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Abstract 

While inverse estimates in the context of radial basis function approximation on boundary-free do¬ 
mains have been known for at least ten years, such theorems for the more important and difficult setting 
of bounded domains have been notably absent. This article develops inverse estimates for finite dimen¬ 
sional spaces arising in radial basis function approximation and meshless methods. The inverse estimates 
we consider control Sobolev norms of linear combinations of a localized basis by the L p norm over a 
bounded domain. The localized basis is generated by forming local Lagrange functions for certain types 
of RBFs (namely Matern and surface spline RBFs). In this way it extends the boundary-free construction 
recently presented in 0. 


1 Introduction 

This article presents a construction for localized bases generated by radial basis functions (RBFs) in the 
presence of a boundary and develops analytic properties of this basis, most notably inverse inequalities. 
Such inequalities are an essential tool in the numerical solution of PDEs by finite element and related 
methods (see ) notably in proving inf-sup (Babuska-Brezzi) conditions, which play a central role 

for mixed element and saddle point problems mmiii]. They are also prevalent in approximation 
theory (where they are called “Bernstein inequalities”); specifically they are used to obtain characterization 
of approximation spaces as interpolation spaces by way of Tv-functionals [1]. 

The type of localized basis investigated in this article has been introduced very recently for the boundary- 
free setting (e.g., on a manifold without boundary) and has already been employed to deliver strong results 
in function approximation and scattered data fitting [S] , numerical quadrature [9] and solution of PDEs [251 
and integral equations [20]. Indeed, in [20], Lehoucq and Rowe have applied the localized basis investigated 
in this article to obtain a Galerkin solution to a constrained integral equation, and they have used the L p 
stability of the basis (presented in this paper in Section 4) to obtain norm bounds on the stiffness matrix 
associated with this problem. 

The inverse estimates we consider treat finite dimensional spaces of functions, bounding strong (Sobolev) 
norms by weak (Lebesgue) norms: 

IMIw-(fi) < Ch- a \\s\\ Lp{n) (or ||s||c-(fi) < C'/r"' T ||s|| ioo(n) for p = oo), (1.1) 
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where 17 is a bounded subset in R d , subject to mild conditions on <917 and h is the fill distance (also known 
as mesh ratio) of the finite set of points used to generate our finite dimensional space (see Section |2.1| for a 
precise description). In one sense, these estimates can be viewed as providing an operator norm bound (from 
L p — > L p ) of differential operators restricted to this finite dimensional space. In another sense, they give 
precise equivalences between different norms in terms of a simple measure of the complexity (given by the 
parameter N above) of the finite dimensional space. Direct consequences of these inverse estimates include 
trace estimates and Bernstein-Nikolskii inequalities. 

This topic has been considered in the boundary-free setting by a number of authors, we list [28], [23], 
I32j . |12j (although there are certainly others). The inequalities we consider here are similar, but depend 
only on the norm of a basic function over a bounded regiorf] Without a doubt this type of estimate is 
significantly more challenging when a boundary is present and has, to the best of our knowledge, remained 
elusive. Indeed, such inverse inequalities seem to have been absent for meshless methods in general (not only 
radial basis function approximation, cf. the discussion in [22) Section 7]). 

In this article we consider two prominent families of radial basis functions: the Matern (or Whittle- 
Matern) and surface spline kernels. Generalizations to other kernels and other settings (namely, com¬ 
pact Riemannian manifolds) are fairly straightforward, but complicated. They have been considered in the 
manuscript m 

The conventional finite dimensional space associated with a positive definite RBF (f> and a finite set 
X C R d has the form S(X) = span— ??); for a conditionally positive definite RBF, S(X) involves 
polynomials; see Section [2. 5. 2| A common set-up for a host of numerical problems invites the user to employ 
the basis of sampled kernels </>(• — 77 ), 77 £ X as one would use polynomials, splines, finite elements, etc.: that 
is to say as test functions for Galerkin or collocation methods, or as basis functions to solve interpolation, 
quadrature or other basic problems. 

For a basic interpolation problem, using S(X) to interpolate data sampled at the point set X, the ensuing 
interpolation matrix will be positive definite, thanks to the kernel’s positive definiteness, but if A' is sampled 
densely, the interpolation matrix will become dense|^] 

Instead of using the basis of kernels, one may attempt to use another basis for S(X); one for which basic 
matrices (Gram, collocation, stiffness, interpolation) exhibit off-diagonal decay. Univariate splines provide 
a prime example of this phenomenon: for a fixed k, the shifted truncated powers (x — tj) 1 —> provide, in 
conjunction with polynomials of degree k or less, a basis for the spline space with breakpoints at tj, but this 
basis is known to be poorly localized. However, the R-spline basis is well-localized, with elements having 
support which is not only compactly supported, but stationary in the sense that it shrinks with the spacing 
of the breakpoints. 

We are concerned with an analogous localization problem for radial basis functions: 

Is there a basis for S(X) where the various elements exhibit a fast rate of stationary decay? 

If an alternative basis is available for which the interpolation matrices are sparse, we say the basis is well- 
localized. For the Matern and surface-spline kernels, the Lagrange function \r] is well localized in a neigh¬ 
borhood of where the points from X are distributed quasi-uniformly. If this is not the case, for instance if 
77 occurs near to the boundary of the convex hull of X, localization is lost. 

This issue can be circumvented by using only the Lagrange basis elements that have centers £ in a 
sufficiently large subset 3 C A', where 3 is chosen so that the Lagrange functions y^, ( £ 3, are localized. 
Using these elements we may define Vs := span^ gH yj, which is of course a subspace of S( A). To avoid a 
possible point of confusion, we emphasize that Vs ^ >5(3). The former space requires all basis functions 
centered in A for its construction, the latter only those in 5. 

After this initial streamlining, it is important to note that even though y^, £ € 3, is spatially localized, 
its construction still requires all of the points in A. Thus finding the y^’s is computationally expensive. In 

1 A previous result in the setting of a bounded region was presented in m, but these estimates significantly undershoot the 
precise exponent —a in 

2 One may attempt to circumvent this problem by dilating the kernel; this is often done, but will generally result in degraded 
rates of approximation. 
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[5] , local Lagrange functions { 6 ^}j e s were introduced. Constructing them is done by first choosing points 
Y(£) C X in a small neighborhood of £ £ S, and then finding the Lagrange function £ S( T(£)) C S(X). 
Since Y(£) will contain many fewer points then X, it will be much less expensive to find b$. Finally, we 
define Vs = spam gS 6 c, which is a subspace of S(X). We remark that Xf 7 ^ ^ and Vs 7 ^ Vs■ However, they 
are close - a fact that will prove important in the sequel. 

We now turn to the connection between the set H and the spaces described above. At the start, we are 
given a quasi-uniform set S £ fl. The enlarged set X is not given. Rather, an extension is constructed 
from S, using a method - describedjn Section [273| that preserves the key geometric properties of 5. The 
extension, which will be denoted by S later (instead of X), is contained in a bounded region H that contains 
Hand is roughly speaking about twi ce th e size of SI. It is for this setup that we get estimates of the form 


( 1 . 1 ) for s £ Vs or Vs . (See Theorem 5.1 ) 


1.1 Overview and Outline 

We begin by giving basic explanation and background on RBFs used in this article. This is done in Section 
2 . 

In Section 3, we introduce the Lagrange basis (the functions generating the space Vs) and provide 
estimates that control the Sobolev norm (i.e., W p (Q)) of a function in Vs by the ^ p norm on the Lagrange 
coefficients and in addition by the L p norm of s. That is, for s = a ?X£ we show 


IMIw/Cn) < C(#S)^-^||(a 4 ) ? 6 s|| M E) and \\s\\ w?m < C'(#S)-^|| S || ip(f2) . 

Such a result has not appeared previously. 

Section 4 introduces the other stable basis considered in this paper: the local Lagrange basis, which 
generates the space Vs- We give sufficient conditions to prove existence and stability of such a basis. We 
give estimates that control the Sobolev norm (i.e., W p (Q)) of a function in Vs by the £ p norm on the local 
Lagrange coefficients and by the L p norm of the function. This result is presented in Theorem 4.10 Next we 


compare the sequence norm with the L p norm of an expansion s = a £X£ £ Vs or s = a £X£ €= Vs 

over the domain fi. We thus obtain 

||(a£) |e E|| 4 (E)~C(#S)- 1 /P|| S || Lp(a) . 

In the final section we give our main inverse estimates. For s £ Vs we have 

IMIw^fi) < C(#E)~ a/d \\s\\ Lp ^, 

and we use this to demonstrate trace estimates for that space. 


2 Background: RBF approximation on bounded domains 

We begin by describing the basic elements used in this article, starting with geometric properties of point 
sets, a discussion of the the underlying domain, smoothness spaces on the domain, and finishing with some 
background about the radial basis functions which we use. 

2.1 Point sets 

Given a set D C and a discrete, possibly infinite, set Icfl, we define its fill distance (or mesh norm) 
h, the separation radius q and the mesh ratio p to be: 

h(X,D) := sup dist(.T, X), q{X) := ]- inf dist(£, A' \ {£}), p(X,D) := ^ , (2.1) 
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where in defining p(X, D) we assume that q(X) > 0. 

When there is no chance of confusion, we drop dependence in these parameters on X and D (referring 
simply to h,q and p). 

Remark 2.1. A finite fill distance h guarantees that the set D is covered by the family of balls B((,h) := 
{x £ D | dist(x,£) < h}, ( £ X. A positive separation radius q guarantees that B(t;,q) n B((,q) = {} 
for distinct (,( £ H. The mesh ratio, which automatically satisfies p > 1, measures the uniformity of the 
distribution of X in D. The larger p(X,D) is, the less uniform the distribution is. If p is finite, then we say 
that the point set X is quasi-uniformly distributed (in D), or simply that X is quasi-uniform. 

Note that, for a compact subset D and a nonempty, finite subset X C D, the fill distance and separation 
radius are both positive and finite 0 < q < h < oo. Consequently, p is finite, too. 

Many of the results in this article depend in some way on the geometry of the point set X - often this 
emerges in an estimate, where a constant depends on p. In most cases, (as one may expect) the strength of 
the estimate degrades as p increases. Throughout the paper, we have attempted make this control explicit, 
by factoring, whenever possible, the constant into a part which is totally independent of the point set, and 
another, which is a function of p. 

It is often useful to estimate certain sums over X. Assume that q(X) >0. If / : [0, oo) —> [0, oo) is a 
positive, decreasing, continuous function, then 

OO 

/( dist (C,£» < /(0) +C , J2n d ~ 1 f{nq) (2.2) 

Cex n=i 


where C depends only on the spatial dimension d. This is easily established by introducing annuli centered 
at £, with inside radius nq and outside radius ( n + 1 )q, n > 1. The number of points contained in each 
annulus is proportional to n^ 1 , and the contribution to the sum from each n, n > 1 , is less than f(nq). 
Hence, (12.21) holds. 


2.2 The domain 

We now consider a bounded region HcK 1 * containing a finite point set S with h = /i(£, fi) and q = g(S) as 
defined above. This presents two challenges. 

The first concerns S - although we may expect it to be finely sampled (often referred to as sufficiently 
dense , meaning that /i(S, fl) is small) in H, it will not be so in a neighborhood of H. To construct the 
localized bases to be used in the sequel, we need a larger set X C R d so that X n fl = 3. In other words, 
we require some extra points to lie outside of H (in fact, when working with local Lagrange functions b £, it 
suffices to consider only a very small extension T C {x £ R d | dist(s, fl) < Kh\ log /i|}). This assumption 
is in place to guarantee decay of the basis functions - in other words, it is only a tool for guaranteeing the 
decay of x$, or and is not otherwise important for the stability estimate. It would be quite reasonable to 
be ‘given’ initially only the set ScH and to use this to construct X . In Lemma [2~2] below we demonstrate 
how to extend a given set of centers £ C II in a controlled way to obtain a satisfactory set X. 

The second challenge concerns the domain fl. For estimates relating ||a||^ and the L p norm of expansions 
|| ]T)c a^&dl or || ajX'dl the boundary becomes more important. The extra assumption we make on fl, in 
force throughout the article, is that fl satisfies an interior cone condition (see Appendix[A]for a discussion). 


2.3 Extending points 

Given H and S C H, we wish to find an extension £ e xt £> 2 dense in so that the separation radius does not 
decrease and the fill distance is controlled (and, consequently, p does not increase). A simple constructive 
example is the following. 

Lemma 2.2. Suppose E C Cl has fill distance (i(S, f2) = h and separation radius q( S) = q. Then there is a 
discrete set S ext so that S ext n H = S, g(S ext ) = q, and /i(£ ext , R d ) = h(Vd/2 + 2). 
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Proof. We proceed as follows: let S ext = SU{( S hZ d | dist(C,f2) > h}. We note that h(hZ d ,M. d ) = ^h 
and q(hZ d ) = h. It follows immediately that q(E ext ) = q. If x £ M. d is within + 1 )h of f2, then 
dist(£, 2) < + 2 )h. On the other hand, if x £ satisfies dist(a;, O) > + 1 )h then there is ^ £ KL d 

with dist(a;,£) < so that dist(£, O) > h (and £ is therefore in 5 ext ). □ 



Figure 1: This image shows the domain Q (the region inside the cardioid) with a set of points S C O 
indicated with •. The dotted line segment indicates h - the greatest distance between a point of Q and one 
of 5. The_solid line segment indicates 2 q, the nearest neighbor distance in 2. The elements in the extended 
point set H \ S are denoted with a square - these are the centers used to construct (discussed in Section 
3). The points ■ denote the points of T, which are used to construct b^ (this is done in Section 4). 


Remark 2.3. We note that other extensions exist which do not increase h. For example, m Lemma 5.1] 
extends points so that h(E ext ,M. d ) = h and q(]E ext ) = mm(q,h/2). As an expository convenience, we use an 
extension S ext of S to which does not increase h. In practice, an extension could be used which might 
not precisely preserve the geometry of the point set (such as the elementary one in Lemma 2.2). This will 
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not change the results in this paper, other than by modifying slightly the constants. We leave it to the reader 
to make the (very simple) modifications necessary to treat other extensions (which would increase h and p). 

We construct the extended point set in an extended neighborhood 

S := S ext n Cl where Cl := {x G | dist(x, fl) < diam(fi)} (2.3) 

and where S ext is constructed according to the method of Remark |2.3[ 


2.4 Smoothness spaces on 

In order to present a suitably robust family of inverse estimates, we employ a scale of spaces depending on 
a positive, occasionally fractional, smoothness parameter; as in pQ, for integer values of this parameter, we 
use the conventional Sobolev spaces, while for fractional values we use fractional spaces, which involve a 
Holder-like seminorm. 

For a domain !)cl d , the integer order Sobolev space is defined in the conventional way. For 1 < p < oo 
and m £ N, we have the semi-norm and norm 


I W™(n) H 

| Ct | = 77 ? 


\D a u(x)\ p dx, 


i p 


n) H 

k —0 


'wMiiy 


Note that for the first expression (the Sobolev semi-norm), we use the binomial coefficient with multi-integers 


= ai , m! Qrf! while for the second we use a standard binomial coefficient = k\(m-k)\ • Although 

other weights would give equivalent norms, resulting in the same Sobolev spaces equipped with the same 
topology, these choices of coefficients will be necessary to obtain the specific reproducing kernels we desire 
(see Section 2.5.1 and 2.5.21. 

For fractional orders a = m + S ^ N with 0 < S < 1 we add the Slobodeckij semi-norm 

\D a u(x) -D a u(y)\ p 


I W-(Q) H 

| a |=rr; 


Q JQ 


\x - y\ d +P s 


■ dx’dy, 


IHIwwn) IMIwwn) 




We note that when o = m + S is fractionaFl W° (fl) is the Besov space B^ p (Q) (this is [3 Theorem 6.7]). In 
particular, W p (fl) = B p (Cl) = [W™(Cl),W™ +1 (d)\ 5 tP serves as the [<5,p] (real) interpolation space between 
IF ” 1 (O) and IF™ + 1 (H) (see [31], 1.6.2] for a definition and basic results). 

Of particular importance is the fact that, for 2 < p < oo and m £ N, we have the continuous embedding 
W 2 m (fi) C W p (fl) for all s <m — (d/2 — d/p). 

Throughout the paper, we make the (not unusual) modification W™(f2) = C m (Cl) when p = oo and 
to € N. For fractional order spaces when p = oo (discussed in Section 5), we use the Holder space C s (fl), 
for which max| a | = ^j sup^, is finite for S = s — [_sj- In this case, IF]”(fl) C W p (Cl) for 

all s < m — d/2. 


2.4.1 Scaling and fractional Sobolev spaces 

For an open set O C let us introduce the notation Or := {a: | x/R € O}. The following lemma shows 
how the fractional Sobolev seminorm scales with R. 

Lemma 2.4. Suppose 1 < p < oo, s € [0, oo) and u £ W p (Op/). Let U : O —> C : x u(Rx). Then 

\ u \w°(o R ) = CR d/p ~ s \U\ W s(o) 

3 When a = in is an integer, we have = B™ 2 WL), although SI) C for p > 2 and W™( O) O for 

p < 2. 
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Proof. We consider the case 1 < p < oo and s = k + 5, 0 < <5 < 1, since the cases where s is an integer and 
p = oo follow similarly, but are much easier. For RX = x, the chain rule gives us D a u(x) = R~^U(X) and 



\D a u(x) -D a u(y)\ p 
\x - y\d+pS 


dydx 


j^d—pS—pk 



\D a U(X) — D a U(Y)\ p 
\X - Y\ d +P s 


dYdX 


R d ~ ps \U\ 


w°(oy 


□ 


2.4.2 Sub-additivity and fractional Sobolev spaces 

Carstensen and Faermann [2] have pointed out that the pth power of the fractional Sobolev semi¬ 

norm fails to be sub-additive. This is in contrast to the (pth power) integral order seminorms, which are 
obtained from integrals of non-negative functions, and are easily seen to be sub-additive. 

The following lemma is a modification of a result of Faermann m Lemma 3.1]) which we use as a tool 
to treat the issue of non-subadditivity. This will be used in the sequel. 

Lemma 2.5. Suppose V = {vj \ j £ Af} is a countable family of subsets vj C SI covering S7 with finite 
overlap: i.e., SI C UjcAfTi anc ^ there is M > 0 so that max^gn SjeA/ - ( x ) < Af. Suppose further that 
there exist sets Vj D Vj so that the complements Wj := S7 \ Vj each are a fixed positive distance from the 
corresponding sets vy. i.e., there is H > 0 so that for every j £ Af, inf^gj. yew . \x — y\ > H. 

Let 1 < p < oo and s £ (0, oo) \ N with k = |_sj and S = s — |_sj. Then for any u £ W p (SI) we have 


lws(n) 




Vi e a r 




+ CMH- pS \\u\\ p wk(n) 


(2.4) 


Proof. By sub-additivity of the outer integral, we have that 

\D a u(x) -D a u(y)\ p 


- XX 

H =kjeN Jv i 

< XX 

\a\=kj&X 


n I* - y\ d+pS 

| D a u{x) - D a u{y)\ p 
\ X - y\ d +P 5 


dyda; 
dydx 


| D a u{x) - D a u(y)\ p 
\x - y\ d +P s 


dydx . 


The first terms are controlled by ^)=i MuWu )’ s i nce Vj C Vj, and so this gives the first part of the right 
hand side of (2.41. 

Consider the sum of the second terms. Applying the quasi-triangle inequality (a + b) p < C p (a p + b p ) to 
the numerator \D a u(x) — D a u(y)\ p , we obtain 


\D a u(x) -D a u(y)\ p 
|a; - y\ d +P s 


dydx < C p 


u 3 w 3 

=: Jj,i + Jj,2- 


\D a u(x)\ p 
\ X - y\ d +P s 


dydx - 


\D a u(y)\ p 

|x - y\ d +P s 


dxdy 


We have that (Jjejvfe xw j) C {(x, y) £ SI 2 | |x — y\ > H}. By symmetry, we have also that UjeA/’( u, J xc 
{(x,y) € SI 2 | |x — z/| > H}. Using the finite overlap, we have that 


ThjeN A[Cj xwy 


< tfX{( j! 9 ) e n^\\ x _ y \ >H }- 
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Consequently, for any non-negative, integrable g : fl x fl —y K, we have 


^jeMLf Wj 9(x,y)dxdy^ r 

r r ( — M 9( x ,y) 

Eje N J Wj J dj a(x, y)dxdy J J {{x, y )^\\x- v \>H} 

as well. Setting g(x,y) = “.d+pi and applying Fubini - Tonelli allows us to control E+ Jj, 2 ) by 


y. (jj, 1+^,2) < 2 M 

j'eAA 


|D q m(x)| p 


/ , < CMH~ pS [ |£> a it(x)| p dx. 

/{(rc,i/)en 2 ||x-i/|>ff} \ x — y\ d+p J n 

(For the last estimate, we have used the fact that f\ x ^ y \ >H jy^p+^dy < C p sH~ pd for all x.) 


□ 


2.5 Radial basis functions 


There are two families of radial basis functions considered in this article: the Matern functions and the 
surface splines. Both families (under the right conditions) admit exponentially decaying basis functions 
- this is mentioned in Section 3.1 They also admit rapidly constructed localized basis functions (having 
polynomial decay) - this is demonstrated in Sections 4.3.1 and 4.3.2 The results we present in Sections [5] 
hold for these families. 

Two features common to both families are: 


1. For any finite set of points 2 C the interpolation problem is well posed. This means that for any 
data (£,y£)fG~ 5 there exists a unique interpolant s generated by the RBF. 

2. The RBF is a reproducing kernel for a (semi-)Hilbert space, called the native space, and the unique 
interpolant to (CtV^)^gs is the best interpolant in this space: it has the least (semi-)norm among all 
interpolants to the data. 

We include both families (which are in some ways quite similar) because both are often in use, practically. 
The first is prized for the RBF’s rapid decay and strict positive definiteness; the second is included for its 
dilation invariance and its historical significance. Of course, there are many other prominent families of 
RBFs, each with its own distinguishing features (some are infinitely smooth, some are compactly supported, 
etc.). Rather than give a broad overview, we introduce the specific families employed in this paper and direct 
the interested reader to [33] for a comprehensive introduction to RBF theory. At this point it is unclear if 
the algorithm for constructing localized bases works for other families; the arguments we employ rely heavily 
on the RBF’s role as the fundamental solution to an elliptic partial differential operator. 


2.5.1 Matern kernels 

The Matern function of order m > d/2 is defined as 

-> R : x CK m _ d/2 {\x\) \x\ m ~ d / 2 . (2.5) 

Here C is a constant depending on m and d, and K„ is a Bessel function of the second kind. 

The Matern function is positive definite, which means that for every finite set X C R d , the collocation 
matrix 

Kx := (Km(( - C))?,cex 

is strictly positive definite. 

The guaranteed invertibility of Kx is of use in solving interpolation problems - given y £ , one finds 

a £ so that K^a = y. It follows that E^eA' a i K m(- — 0 is the unique interpolant to (£>2/£){ex in 
S(X) := span ^ eX K m (- - 0- 









It is the reproducing kernel for the Hilbert space Af(nm) = W™(R d ) equipped with the (standard) inner 
product 


{fi g)w™( R d ) 



where D 3 f is the tensor (i.e., the ^-dimensional array) of partial derivatives of order j. Being the reproducing 
kernel means simply that f(x) = (/, n m (x — •)}w 2 m (R' i ) f° r a ll £ £ and all / £ W™(R d ). It can be shown 
that among all functions interpolating the data (£, the interpolant Y2^^x a i K m{- — £) (i.e., where a 

is the solution of K_\'a = y) has the smallest H /r | rl (]R d ) norm. 


2.5.2 Surface splines 


For m > d/2, the surface spline is 


<j) m : —> R. : x C 


\x\ 2m ~ d d is odd 

\x\ 2rn ~ d log \x\ d is even. 


( 2 . 6 ) 


The surface spline of order m is conditionally positive definite (CPD) with respect to n m _i, the space 
of polynomials of degree m — 1. This means that for every finite set X C R d , the quadratic form M A —► R. : 
a i ^ (a, K^a) = Y2$ex Efex — C) a £ a C is positive for all nonzero a £ R A satisfying Y2$ex a iV{Y) = 0 
for all p £ (In other words, it is positive definite on a subspace of R A of finite codimension (namely, 

the annihilator of H m -i |x)- 

One may solve interpolation problems using the finite dimensional space 


S(X) := 


X&x 


a£<f>m(- ~ Q | a$p(0 = 0 for all p £ T\ m -i 


+ n m _i 


provided that data sites X are unisolvent: i.e., so that if p £ n m „i satisfies p(£) = 0 for all £ £ X 
then p = 0. Let {p\,... ,pn} be a basis for and construct the f/X x N Vandermonde matrix <I> = 

{Pj(0)iex,j=i,...,N- For data y £ R A one finds a £ R A and c £ I w so that 

K Y fa\ _ (y 
$ T Oj ^0 

It follows that Sx '■= Yl^^x — 0 + Y2f= i c jPj is the unique interpolant to {f,y^)^ex in S(X). 

The surface spline (fi m is the reproducing kernel for the semi-Hilbert space 


D~ m L 2 = {/ £ C(R d ) | V|a| = to, D a f £ L 2 (R d )} 


(sometimes called the Beppo-Levi space), which is a semi-Hilbert space (a vector space having a semi-definite 
inner product with nullspace n m _ ls so that D~ m L 2 /H m _ 1 is a Hilbert space). The space D~ m L 2 is endowed 
with the semi-definite product 

{f,9 )D~™l 2 = (f,g)w™(R d ) = f (D m f(x),D m g{x))dx = ^ (™'\ [ D p f{x)D p g(x)dx. 

jR d p =m \P / JR d 


Although (j) m ^ D~ m L 2 (its ruth derivatives behave, roughly, like 0{\x\ m ~ d ), which is not square inte- 
grable, since 2 to > d), with a little effort, one may show that the spaces S(X) are contained in D~ m L 2 . The 
RBF (f> m is its reproducing kernel in the sense that for X C R d and two functions /i ,/2 £ D~ m L 2 where 
f 2 has the form f 2 = «£</>(■ -Q+P€ s ( x ) we have {h, h) d-^l?. = a «/i(£)- Th e interested 

reader will find a material on surface splines and conditionally positive definite RBFs in [331 Chapter 8 ]. 

As in the case of Matern kernels, the unique interpolant residing in S(X) has the smallest D~ m L 2 
semi-norm among all interpolants to the data (f-,y^)^x- 
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2.5.3 Labeling kernels 

In most cases in this article, the Matern and surface spline RBFs exhibit similar behaviors. Because our 
results often depend only on a single parameter m indexing the RBF, we use the abbreviated notation k m 
to stand for either n m or 4> m . 

In both cases, the function k m has L p smoothness less than 2m — d + d/p (i.e., for any bounded set If, 
k m £ Wp (0) for all a < 2m — d + d/p). It follows that any finite linear combination of shifts of k m has the 
same regularity. Denote the space of such linear combinations as 


|span<r eJs: ft m (- - £) k m = 

- 0 I Sc a tP(€) = 0 f° r a11 P £ n m _i| +n m _i km = 4>n 


Then we have 

S(X) C Wp (f2) for all a < 2m — d( 1 — 1/p). 

Likewise, we let A f(k m ) represent either of the two native spaces: IT™(R d ) or D~ m L 2 (S . d ). We note that 
both satisfy the continuous embedding IF™(R d ) C A f{k m ) C W™ loc (R d ). In this case, the functions in the 
native space have a lower L p regularity, with 


A f ( k m ) C 


w;( fi) 
c s (n) 


for s < m — (d/2 — d/p) + , 1 < p < oo, 

for s < m — d/2. 


3 Lagrange functions and first Bernstein inequalities 

In this section we investigate some further results about the RBF k m \ namely, we consider analytic properties 
of the Lagrange functions. These have been presented in T5j, but we explain them below for the sake of 
completeness. 

After this we give a first class of Bernstein estimates, valid for linear combinations of Lagrange functions. 


3.1 Lagrange functions 

For a finite X C R d , there exists a family of (uniquely defined) functions (xf)£ex satisfying %£ G S(X) 
and XiiC) = £(£, 0 f° r a ll ( £ X. We may take the N{k m ) inner product of two Lagrange functions 
XC e s i x )i noting that they have the form xt = J2 v ex A v,i k ™{-~V )+P and xc = E, ;G a A vX k rn(-~v)+P 
(in the case of Matern functions k m = K m , we have p = p = 0), to obtain 

(X6XcW m ) = (X|, Er,ex A vX k m(- - V) +P) Ar(k) = T, v ex A v,(Xdv) = A t,c- (3-1) 


Lagrange function coefficients We can make the following ‘bump estimate’ which uses a bump function 
V’S.g = : ~:► [0,1] that is compactly supported in B(£, q) and satisfies ip£,q(£) = 1 on a neighborhood 

of q. We have 

llxdlAf(fc m ) < IIV’£,gllA/'(fcm) < C||V’4,gllw 2 m (R‘ 1 ) < C? 2 m - (3-2) 

This follows because xc is the best interpolant to ( —>• £(£,£). As a consequence, Lagrange coefficients are 
uniformly bounded: 

l^,d = l<XC,XcW)l <Cq d - 2m . (3.3) 


Better decay For the kernels considered in this article, and more generally for the framework given in m 
and |T9j, to get desired estimates for Lagrange functions over a compact region ficl^ the interpolatory 
conditions must be satisfied on a point set thatjs suitably dense in a t fair ly large neighborhood of 0. To 
handle this, we use the quasi-uniform extension 
of V E . 


developed in Section 2.2 This brings us to the definition 


10 



Definition 3.1. For a compact set Q C R d and a finite subset S C 12, let 2 be the extension to {x £ M. d \ 
dist(x, fl) < diam(fi)} given in [2.3] in Section 2.3. Then for the system of Lagrange functions (xj)j g 5 

generated by k m over H, let 

:= span{x 5 | f. £ H}. 

In particular Vs C S(E). 

For £ in the original set 5, we have the improved estimates: 

llxdlw 2 m (® d \B(£,fl)) < Cq d/2 ~ m exp > for all 0 <R< dist(£, dfl). (3.4) 

This is demonstrated in Appendix [Aj specifically in Lemma s This leads to improved estimates. For 
(eS and all x £ fl 

(3.5) 

(3.6) 


Likewise, for (,(£2, 


\xd*)\ < C'p m - d/2 exp • 

\A iX \<Cq d - 2m exp^-o 1 -^-^ 


3.2 Stability of the Lagrange-function basis for on 0 

Recall that Vs = span{x^}^ £H , where 5 is a subset of all of the centers in 5. We begin by defining the 
operator T : C“ —i Vs by Ta = a ?X'f =: s - I 11 other words, T is the synthesis operator, which takes a 

set of coefficients {agl^s and outputs a function s £ Vs■ Because the basis in consideration is the Lagrange 
basis, the coefficients satisfy T a(£) = s(£) = for £ £ S and therefore, for the basis {x^}{es, the operator 
T is an interpolation operator. 

If we use the £ p (E) norm for C“ and L p (Q) for Vs, then the stability of the basis, relative to these 
norms, is measured by comparing ||a||^ (h) and ||s||l (q). We show this with the following proposition, which 
indicates that if s £ Vs is small (relative to L p (fl)) then its coefficients are small in £ P (E) (and likewise, if 
the coefficients of s are small in £ p (S), so too is the norm of s £ L p (Ll)). 

Proposition 3.2. (Lagrange Basis Stability) There is a constant ho = ho{m,d), so that for S C Cl 
satisfying h(E, f l) < ho and 1 < p < oo, we have constants C\ = Ci(m, p, ST) > 0 and c 2 = c 2 (m, fl) so that 

ci ||a||^ (H) < W d/p |IE^H a «xdU P (n) < c 2 p m+d / p ||a|| MH) . (3.7) 

We remark that the dependence in the lower constant c\ on p can be made explicit. This is worked out 
in Lemma IB.61 


Proof. The proof is given in 
lower bound. 


Appendix [Bj Lemma |B. 1 | provides the upper bound and Lemma B .6 gives the 

□ 


Another way to think of this inequality is as an L p (Q) Marcinkiewicz-Zygmund (MZ) inequality. Such 
inequalities are used to relate the L i norm of a trigonometric polynomial to the l\ norm of the polynomial 
evaluated on some fixed, finite set. MZ inequalities have also been developed for spherical polynomials on 
S d |23]. For spherical polynomials in § d , there is another type of inequality, a Nikolskii inequality. On § d , 
these have the form ||<S'||,l p < CL d ^~p' ) + ||5|| ir .(§d), for any degree L spherical polynomial. Our result below 
establishes such an inequality for Vs- 

Corollary 3.3. (Nikolskii Inequality) With the assumptions and notation of Proposition \3V\ and with 
1 < p, r < oo, we have that 

INImo) < (3.8) 

with C = C(m, p,Q,p,r). 


11 










Proof. Recall that, for a £ C“, ||a||< N^p ^ ||a||f r (E), where N = ffiE. Since N ~ q d , this inequality 
implies that ||a||^( H ) < Co,^ r ^q~ d ^p~^ + ||a||^ r ( S ). From this and (|3.7[), we thus have 


L„(fi) — Cp,m,n,p,r9 


d (|-(|-?) + ) I 


r (H) — Cp z rn,Q,p,rQ 




\L r {Q)- 


The result follows from the identity x — ( x)+ = — (— x) + . 


□ 


3.3 Bernstein type estimates for (full) Lagrange functions 

In this section we will provide a Bernstein (or inverse) theorem relating Sobolev norms of functions in Vs 
to the corresponding f. p norms on the coefficients. This is the key to controlling the Sobolev norm of the 
function in Vs by its L p (Cl) norm. 

Before proceeding, we first prove two lemmas. 


Lemma 3.4. Let 1 < p < oo, s > d/2 and 0 < a < s. Suppose O is a fixed open set with Lipschitz boundary 
and as before On = {# | x/R £ O}. Suppose further that Wf(O) is embedded continuously in W p (0) (where 
we take C a in case p = oo). Then there is a constant C depending on O, p, s and a so that if U £ W p (O ) 
and if the set X of zeros of U in O are sufficiently dense that h(X, O) < 1 and Lemma A.l applies then for 
u(x) = U(x/R), we have 

\ u \wz{o R ) < CR s ~ a+d/p ~ d/2 \u\ W s(o R ) 


Proof. Lemma 2.4 shows that \u\w”(o R ) = R d ^ p a \U\wp( 0 )- Because ||L/||(O) < C\\U\\w a ( 0 )i we have that 

1/2 


I w ? to R ) < CR d / p -°\\U\\ Wi{0) = CR d / p ~° (j2 k j=0 


WfiO) 


\ U \w£( 0 ) 


(in case s is fractional; 


we leave the necessary modification to the reader in case s £ N). We now apply the zeros estimate (27, 
Theorem 4.2] to each term on the right hand side, obtaining 

\ u \wz(o R ) = CR d / p ~ <T \U\ W s( 0 ), 


since |f7|^ (0) < Ch 2( ~ s j) \U\^. (0) for all j. Applying Lemma 
desired estimate. 


2.4 


again, (this time with p = 2 ), yields the 

□ 


Lemma 3.5. Suppose S is sufficiently dense (meaning h(E,Ll) < ho for a constant ho = ho{d,m) > OJ and 
r) £ S. Decompose 5 into disjoint annuli S = (J°l 0 Sj(? f): where Ej( rf) := {C £ S | 2 : >~ 1 h < dist(<^, 77 ) < 2^h} 
for j > 0 and S 0 (^) := {£ £ H | 0 < dist(C, 17 ) < h}. 

We have, for 2 < p < 00 and 0 < a < m — d/2 + d/p that 

^ (3-9) 

with C = C(p, a, m, d) and v = v(m, d). 

Proof. Repeatedly applying the quasi-triangle inequality ||a + 6|| p < 2 p_1 (||a|| p + ||fo|| p ) to this sum gives 

OO 

II X] a ?^llw^(B(r ? ,3? l )) - 2<J+ ^11 E a €X?( a; )|| W o'(B(r ) ,3/ l ))‘ 

C6H 2=0 HeZfir,) 

Observe that //Efig) < 0Jdp d 2 jd (where the constant Ud depends on d), so the generalization of the above 
quasi-triangle inequality || Xq=i a ill P — nP ~ 1 S/=i ll a 2 'll P gi yes 

OO 

IIE^ii^ ( b ( ^)) <E^+^w^r 1 E ki p iixdi^( bo, 3 ,))• (3-io) 

€e= j=o 
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For dist(£, rf) = R sufficiently large, we have the inclusion B(rj , 3 h) C Cl \ £?(£, R — 3 h). Applying the zeros 
lemma |261 Theorem 1.1] gives 


llxdl 


W"(B(n,3h)) 


< llxdl 


WZ(n\B(Z,R-3h)) 


< Ch m - a - ( - dl 2 ~ d /p)+ || X5 || 


W£ l (Q\B(t,R-3h)) ■ 


Applying the energy estimate (3.4) and noting that d/2 — d/p > 0 gives us 

llxdl< Ch m - a - {d / 2 - d/p) +q d/2 - m e- uR/h = Cp m - d / 2 h d/p - a e-' R/h . 


We note that for £ in the annular set S ? (r/), dist(£, 77 ) = R, > /i2 J 1 , so e vR / h < e 2 2J . Applying this to 
(|3.10[) gives 


5I a «xc| 


p 

W ? (B( v ,3h)) 


< Cuj p ~ x p p ( rn ~ d / 2 )+ d ( p - x ) 2J’( d+1 )(p~ 1 ) |a ? | p /d _pCT e _ ^ p2J 


i=o 




< (jpP( m + d / 2 )- d h d -P< 7 2 j(d+1 ^ p ~^e~^ p2J y |a ? | p 

1=0 SeHj(rj) 


(3.11) 


□ 


Note that when p = oo, we use only integer smoothness a = k £ Z and the standard space C k (tt) of k 
times integral functions over Q. 

Theorem 3.6. For a sufficiently dense set 5 (meaning h(S, 12) < for a constant ho = ho(d,m) > 0 ) we 
have, for 0 < a < m — ( d/2 — d/p)+ when 1 < p < oo (or cr £ N with 0 < a < m — d/2 if p = oo), 

vr» ( n; ^ Cp m + d / 2 + d / p h d / p -° ||a|| /p(a) (3.12) 


with C = C{p, a, m, d). 

Proof. This is handled in four cases: p = oo, 2<p<oo,p=l and 1 < p < 2. 


Case 1: p = oo If a £ Z, we simply need to bound max| a | =0 -max xe n |-D Q X{( :r )|oo- To do this, 

consider a point x £ ft and a ball B(x,r ) C 12 with r = h max(16?n 2 ,1/hi) and hi is the constant from the 
zeros lemma |A.1[ In this case, we use a Bramble-Hilbert argument involving the averaged Taylor polynomial 
Q m Xi °f degree m— 1 described in Brenner-Scott [I]. It follows from [HI (3.9)] that \\D a Q m X$\\L ao (B(x,r )) < 
Cr vn —\ ol \- d / 2 \xt\wg l {B(x,r))- Likewise, we can estimate \\D a (x^ - Q m Xe,)\\L x ,{B(x,r)) by first using Lemma 
(with U((y — £)/r)) = Xz(y) ~ Q m Xt(y) and the embedding W^ l ~^ a \B( 0,1)) C C(B( 0,1)) to obtain 

l(X| - Q m Xt)\c^mx,r)) < Cr-l Q l|C/| c ,«i( B( o,i)) < Cr-W\U\ w ™ m op)) 

Rescaling, gives the estimate 


2.4 


\\D a {xi-Q m xd\\L x ( B{x ,r))<Cr-W 



Each seminorm in this last expression can be estimated by the Bramble-Hilbert Lemma, allowing us to 
bound the above by Cr m ~^~ d / 2 \x^\w^(B(x,r))- Together with the estimate on D^Q m x^, and recalling 
that r = Kh, we have 


\D a xdx)\ < P“Q m XdUoc(B(x,r)) + ||^“(XS - O m X«)IUoo( 5 (x,r)) < Ch m -^- d / 2 \ Xi \w-(B(x,r)). 
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From here, we apply the energy estimate (3.4) to obtain 


d d 


D a xt(x )I < C/i m - |Q| - 2 g 2 - m e-"^ 


(3.13) 


The sum over S can be carried out over annular regions Zj(x) = {£ £ 3 | jh < dist(£,x) < (j + l)/i}. 
This leaves 


Y Y \D a xd x )\< c P m ~^ h ~ M Yj E <Cp m -2h-^Yp d U + ^) d e~ vj <Cp m+ 2h-^. 

j=OteSjix) 3 = 0£eHj(x) 3=0 


In the last inequality, we use the fact that the sum Yl'jLoU + 1) d e ^ = C depends on d and m (but not p). 


Case 2: 2 < p < oo We treat this case in two stages. At first, we consider a = k £ N, treating fractional 
Sobolev exponents for later. 


Case 2i: er = k £ N. By subadditivity, the Sobolev norm may be taken over overlapping balls 

II E a «*« \\w}m ^ EIIE a iXz( x )\\ W k( B ( rlih )y 

jes pen 

Applying Lemma [3~5] to the norm over each ball B(r],h ) C B(r),3h) gives 

OO 

IIE“exellE(O) ^ Cff m+ ^ 2 - d h d ~^ Y 2 j(d+1)(p - 1) e-^ _1 E E Kl p 

P 3 =o rieB^s^n) 

We may exchange summation between £ and 77 , noting that 77 £ Sj(£) iff £ G Ej(r/). This implies the estimate 
E„eE KI P = EtesEveB,® l°d P < “dP d 2 jd £ €6S l a d P - Consequently, 

OO \ 

| af |P. (3.14) 

3=0 ' £en 

The result follows by summing the series and taking the pth root. 


IE 


a «X€ 


ip 

I wi 


(n) 


< Cu} p d ~ 1 p p{m+d/2) h~ pk+d (^ 


Case 2ii: a ^ N. Let a = k + 6 with 0 < 6 < 1, and employ Lemma [2.5| using the neighborhoods 
{ 11 ( 77 , h) fl If | 77 <E S} as {vj | j £ N} and { B(r/,3h ) D fl | 77 £ S} for {vj \ j £ AT}. Note that for this 
choice of cover, M < Cp d . Indeed, for any x £ f2, enumerate the centers {£ £ 3 | dist(£,x) < h} as 
£ 1 , • • ■, £n- Then x £ B(£j,h) for each j = 1... n. Because the balls B(^j,q) are disjoint, it follows that 
n(cd,iq d ) = volflJ" =1 B(xj,q) fl B{x , h)) < Cd,ih d , so n < Cd(h/q) d . (Here Cd^h d is the volume of the ball of 
radius h , and Cd,iq d is the volume of the portion in B(x,h) of any ball which is centered in B(x,h).) This 
guarantees that 


E^xdknn) ^ EII E^dlw^p^)) + °p h 11 E a «X|||wfc(n)- 

pGH fes «ge 


We apply (3.14) to bound the second term, which gives 


Cp d h~^\\Y a ^ llw*(n) ^ C^ m+d ^+ d h-P k+d ~ s \a^. 


The first term is handled precisely as the integer case er = fc, which has been discussed above, leaving 


E II E 3»» S ||a|| 4(H| . 

pGH £ 6 E 
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Case 3: p = 1 We again consider the proof in two steps, first for the case of integer smoothness, where 
the Sobolev norm is sub-additive on sets, and then in the fractional case, where we can apply Lemma 2.5| 


As an initial simplification, note that the triangle inequality gives ||s||wf (fi) < || a IUi(H)( m ax 5 6 H I|X£llwf (fi))> 
so we need only to consider the size of HXfllwf (fi)- 

Case 3i: er = k € N. We proceed, as in Case 2, by first considering a = k S N and using subadditivity 
of the norm. For any integer K, we have Hxdlw^n) < llx« \\w*(B(z,Kh)) + llx^llwf (n\B(£,*rh))- 

The first term satisfies \\xz\\wf(B(z,Kh)) < Ud(Kh) d/ 2 \\Xt\\w$(B(£,Kh)y For K sufficiently large (a con¬ 
stant depending only on d), the zeros estimate [27, Theorem 4.2] gives 


llxdl wf(B((,Kh)) < MKh) d/2 h m ~ k ||xdlw r( n) < CK d ' 2 p m - d / 2 h d - k 


(3.15) 


The second term may be controlled by decomposing D \ B (£, Kh ) = \J^ K At en annuli (i.e., by taking 
At := {x G fi | ih < dist(:r,£) < (£ + l)/i}). Subadditivity gives 


llxdlwf 


(f2 \B(£,Kh)) < 


< 


< 


y, iixsiiwf^) 

e=K 

OO 

^(vol(A,)) 1/2 || x? ||^(^) 

c=k 

OO 

yc((i + i)h) d / 2 h m -°\\xz\\wr(A t ) 


i=K 


In the final line we have applied the zeros estimate (and simultaneously estimated the volume of the annulus 
At). At this point, we can apply the energy estimate (3.4) to obtain 

OO 

llxdkwK,*/,)) < E C W + l)h) d ' 2 h m -° q d ' 2 - m e- ve < Cp m - d ' 2 h d ~”. (3.16) 


e=K 


Combining (3.16) with (3.151, gives the desired result for a = k € N. 


Case 3ii: a ^ N. To handle the fractional case a = k + 5, we apply Lemma |2.5[ with an initial 
decomposition v\ = B(£,Kh), u 2 = fi\ £?(£, Kh), v\ = B(£, (K + l)h) and V 2 = tl\B(£,(K —l)h). Observe 
that these are disjoint, so the overlap constant is M = 1. Thus we have 


2.5 


a second 


Ixdwqqn) < Ixdwf(B(£,(if+i)/i)) + lxdw 7 (n\B(£,(.Fs:-i)h)) + Ch Hxsllw^n)- 

We can further decompose the middle term Ixflwf (n\s(£,(Jf-i)h)) en annuli by applying Lemma 
time. This time, we let vt := {x £ fi | 2 ' l (K — 1 )h < dist(ir, £) < 2 e+1 (I\ — 1 )h} for £ = 0,1,.... The annuli 
{vt | t £ N} partition fi \ B (£, (K — 1 )h), so the overlap constant M is M = 1; in fact, we need only the first 
£o = 1 + log 2 (diam(fi)/(/X — 1 )h) annuli. 

Define the neighborhoods of vt as Vt := {x £ fi | 2 e ~ 1 (K — 1 )h < dist(a:, £) < 2 e+2 (K — 1 )h}. The sets 


wt = fi \ vt satisfy dist(i)^, wt) > \{K — 1 )h > h. Lemma 2.5 shows that 


\Xn\w?(Q\B(i,(K-l)h)) < IE Ixd Wf(vt) I +Ch i ||xdlw 1 fc (n)- 


4 In this case, because 0 < a < m, one could just as easily adopt the strategy of proving the result for the extrema a = 0 
and g = m, and then using interpolation of operators to bound the synthesis operator T : ^i(S) —¥ Wf (Q), noting that W°(Q) 
is the (cr/m, 1) interpolation space between Li(fl) and W j rn (f2). 
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Since ||xdlwf(fi) = llx« II wf(n) + Ixdwf(fi) and h < 1, this leaves 

llxdlwr(n) < Ixdwf (B(£,(K+i)h)) + y Ixdwrfa) ) +C7* 5 ||X€II(fi)> 


(3.17) 


e=o 


which we must estimate. 

Estimating the third term in (3.1 7| ): The final term is easiest to control: Case 3i gives the estimate 

h~ S \\Xt\\wm < Cp m - d ' 2 h d - k - 5 . (3.18) 

Estimating the first term in \3.1 7[ ): The first term in ( |3.17| is controlled in a similar way to ( |3.15| ). Begin 
by setting R. := ( K + 1 )h, u = x$(- — fi) and U = u(R(-)). Applying Lemma [ih4| with O = B(0,1) gives 

Ixdwf (B(Z,(K+l)h)) = \u\w?(B(0,R) < CR d/2+m ~ a \u\ W ^(B{0,R)) = Ch d/2+Tn ~ a \xt\ W ™(B(Z,(K+l)h)) 

The bump estimate (3.21 then gives 


\Xi\wnB(UK+i)h)) < Ch d ' 2+m -° Ixdwjqn) < Ch d / 2+m -°q d ' 2 - m < Cp m - d ' 2 h d -°. 


(3.19) 


Estimating the middle term in Jff.l7| ): To handle the series appearing in ( |3.17 ), we proceed as in the last 
cmn-dr^Vi dT'.r'.Urin rr fnr- ocrnln D T.omTma ‘A A 11 — f? — ( T< — 1 \h aT~ir] TT — lit T? Tn 


paragraph, applying, for each £, Lemma 3.4 , now with u = xj(- — £), R = 2 r+z (K — 1 )h and U = u(R-). In 


this case O = B( 0,1) \ B(0,1/8). The scaling lemma uses the embedding Wif(O) C W[(0) which incurs an 
embedding constant C which is independent of L 

Ixd^TK) = Hwnou) < CR d / 2 + m -”\u\ w ? {OR) = C(2 e + 2 (K- l)) dl2+m -°h d / 2 + m -°\\ Xi \\ w ™ (ve) . 

Since v e is contained in fi\S(^,2^- 1 (A:-l)/i), we have |x«l Wf(v t ) < C2 ^ d,2+m ~° )hd/2+m ~°\\xd W ™{si\B(v e y 
Now we apply the energy estimate (3.4) which gives 

IxdwTM < C2 eW2+m - a) h d/2+m - a q d/2 - rn e-^ K - 1)2( - 1 . 

Observing that the infinite series Yf(f 0 {2 e< ' d ^ 2+m ~ a ' ) e~ ,l( ' K ~ 1 ' ,2e 1 converges to a constant depending only on 
d and m, we can bound the middle term: 

(£ Ixd^M^ < Ch m+d ' 2 -°q d ' 2 ~ m = Cp m - d ' 2 h d -°. 


(3.20) 


The case p = 1 follows from the estimates (3.19), (3.20), (3.18) and (3.17). 


Case 4: 1 < p < 2 In this case, we use Riesz-Thorin to estimate the norm of the operator T : £ p (H) —► 
Wp(fl), where T is the synthesis operator T a = E{eH°fX£ (he., the linear map which takes coordinate 
space C“ into the vector space Vs). Letting 9 = 2(h — i) (so that h = 91 + (1 — 9)\) gives 


(Cp- d/2 h d -°) 6 (Cp m - d/2 h d/2 - a y 6 ||a||^ ( H) 


I a ^xdl w-(O) < 

< Cp m - d l 2 h d l*- a \ !a||, p(H) . 


Using Proposition 3.2 we may replace the discrete norm ||a||^ ( 3 ) by its equivalent h 


obtain an L p version of Theorem |3 . 6 


Corollary 3.7. With the assumptions of Theorem\3.6\ we have 


|£feE°€X«| 


W ? (Q) 


<ch ' T l|Ej e H a «x«| 


Lp (fi) 


□ 

s\\ Lp , and so 

(3.21) 


with C = C(p, cr, m, p, U). 


Explicit dependence of C on p can be obtained from (3.12) and Lemma B. 6 . 
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4 Local Lagrange functions 


We now consider a new class of functions 6e £ 5(s), £ £ 2, constructed in a local and cost-effective way, 
employing only a small set of centers in S that are nea r E. For each £ £ S, this small set is called the local 
footprint of £ and denoted by T(£) C 5 (see Definition |d. l| . Each 6c is a Lagrange interpolant, centered at 
£, for points in T(£). The set T(£) is chosen to give 6c fast decay away from £, although not the exponential. 
The size of the footprint is controlled by a parameter K > 0. 

Unlike the full Lagrange functions, the local versions do not satisfy interpolatory conditions throughout 
5. There is no guarantee that they will have zeros outside of the set T(£) - as a result the operator 
T a = does not satisfy T a(£) = a^. It is only a quasi-interpolant (approximation on the sphere 

with this operator was considered in [8]). 

As in [5] the analysis of this new basis is considered in two steps. First, an intermediate basis function 
is constructed and studied: the truncated Lagrange function. These functions employ the same footprint 
as 6c (i.e., they are members of 5(T(£))) but their construction is global rather than local. This topic is 
considered in Section 4.2 Then, a comparison is made between the truncated Lagrange function and the 


local Lagrange function. The error between local and truncated Lagrange functions is controlled by the size 
of the coefficients in the representation of 6^ — using the standard (kernel) basis for 5(T(£)). This is 
considered in Section 14731 


4.1 Footprint and local Lagrange function 

Definition 4.1. For a compact set Q C_R_ d and a finite subset 5 C D, let S be the extension to {x £ | 

dist(x, 0) < diam(D)} given in Section 2.3 For a positive parameter K, define T(£) := {£ £ S | |£ — £| < 
Kh\logh\} for each £ £ 5. Then for the system of local Lagrange functions (6^)^ g s, where each b j is the 
Lagrange function centered at £, generated by k m over Y(£), let 


Vs := span{6 5 | £ £ E}. 

Note in particular that Vs C S( 5). Indeed, it is contained in a slight expansion of S( 5). Namely, 
U H C S(r), where Y := Uj eS T (0 C {£ £ S | dist(£,D) < Kh\ log h\}. 

The construction of each 6^ depends only on its nearby neighbors in T(£), so the majority of points in 
S are unnecessary from a computational point of view. However, the (analytic properties of) full Lagrange 
functions generated by k m over S will still be of use in proving theorems, so we will continue to refer to 
the extended set S, even though much of it plays no role in the construction of the functions 6j. 

In our main result, we make use of the following: 


Let (Xi)ies be the family of “full” Lagrange functions constructed by k n 
point set S. For any J > 0, the family (6^)j e =; satisfies 

IIXC - Mlw^(n) ^ hJ , for all £ £ S. 


using the extended 


(4.1) 


To obtain this result, we show that for a given J there is a K > 0, which governs the size of the footprint, 
ensuring that ||x$ — ^flloo = 0(h J ) holds. The value of K depends linearly on J, as well as some fixed 
constants involving m and d. 

In the following two sections, we show that this result holds for Matern (in Lemma 4.7) and surface spline 
radial basis functions (in Lemma 4.9). Specifically, this holds for any prescribed value of J, where J depends 
linearly on K, as given in Definition |4.1| 


4.2 Intermediate construction: Truncated Lagrange functions 

For a (full) Lagrange function xe, = Life! 0 +?€ 5(E) on the point set S, the truncated Lagrange 

function x^ := XxeT(£) Ai.cM’’ C) +P is a function in 5(T(£)) obtained by omitting the coefficients outside 
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of T(£) and slightly modifying the remaining coefficients = (A^). (For positive dehnite kernels, no 
modihcation is necessary, and the construction is quite simple.) 

The cost of truncating can be measured using the norm of the omitted coefficients (the tail). 

Lemma 4.2. Let K > (4m — 2 d)/v and for each £ £ S, let T(£) = {C £ 3 | |£ — £| < Kh\ logft|}. Then 


E 


|A S;C | < Cp 2m h Ky / 2+d - 2m 


C 6 S\T (0 


with C = C(m, d ). 

guarantees that E C£ h\t«) l A €,d < Cq d ~ 2m T,\t-c\>Kh\ iogh| ex P ) . 

By observing that for ( e S \ T(£), we have q d < CVol(.B(£, q) \ !?(£, Kh\ log/i|)) with a constant C that 
depends only on the spatial dimension d. (Note that for most £, the above set is simply B(£,q), while for 
those C which are near the boundary of £?(£, Kh\ log h\) the set contains a half-ball), we can control the 
above sum by an integral, namely 


Proof. The inequality (3.6 


E Eel 

Ces\T(|) 


< Cq d ~ 2m Y, expE/EE 

|C-C|>iCfc|logh| ' ' 

< Cq~ 2m Y f ex P (~ u ^ d V 

log h\ AeS(C,g)\B(J,ifh| loghQ V h J 

< Cq~ 2m [ exp dy 

Jyem d \B(£,Kh\logh\) \ n J 


(4.2) 


In the final inequality, we have used the fact that the sets B( £, q) \ B(£, Kh\ log h |) are disjoint and that for 
y £ B{(, q), dist(£, y) < dist(£, () + q < dist(£, Cf) + h, which implies — dist(£, £) < —dist(£, y) + h (leading to 
a small increase in the constant; a factor of e"). 

Applying a polar change of variables in the final integral gives the inequality 


E I A u \<Cq~ 2m 

Ces\T(0 


/ Kh\ log h\ 


exp 



r d ~ l dr. 


We simplify this estimate by splitting v = v /2 + v /2 and writing 


E i^«i 

(es\T(?) 


d„—2m 


< Chf q 

\JK | log h\ 

< Ch d q- 2m h Kvl2 = Cp 2m h Kv/2+d 


r d 1 exp {^—K\ log ft 1exp r^0 dr 


The lemma follows. 


□ 


4.2.1 Bounds for truncated functions: Matern functions 

Let || • \\z be a norm on 5(E) for which a universal constant T exists so that sup. gf2 \\k m (- — z)\\z < T. Since 
IIM- - z )\\z is Hnite and bounded independent of z, we have 

Hx«-xdU<r ^ \A u \<cr P 2rn h K ^ 2 - 2m+d . (4.3) 

C£H\T(0 

In particular, we have the following: 
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Lemma 4.3. Let m > d/2 and consider the Matern radial basis function k m = n m described in p?.5[ ). For 
1 < p < oo and a < 2m — d + | we have 

llxs - xdwzm < £ |^,d ll« n ,(-,C)ll^(R- ) < c P 2 m h K ^ 2+d ~ 2m 

Ce A'\T(j) 


with C = C{m , d). 

For p = oo, f/ie above result holds for the Holder space W^K**) replaced with C CT (R d ) 


Proof. We have from [15] [Lemma A.l] that n m £ Wif (R d ) for 1 < p < oo and r < 2m — d + d/p , while for 
p = oo, K m £ C T (M. d ) with r < 2m — d. In either case, the smoothness norm is translation invariant, so it 
follows that 

^Ollwp (M d ) — C T ,p and ■2')IIc x (R d ) — G r? oo 


hold. The result follows from (4.3). 


□ 


4.2.2 Bounds for truncated functions: Surface splines 

When k m = 4> m (i.e., a surface spline, and therefore conditionally positive definite), the argument of the 
previous section is a little more complicated. Given a Lagrange function = X<e x ^C,C&m( - > C) + P, simply 
truncating coefficients does not yield a function in 5(T(£)). That is, does not necessarily satisfy 

the side condition T(0 ,{P(C) = 0 for a U P £ 

The result for restricted surface splines on even dimensional spheres (§ 2n ) has been developed in jSJ 
Proposition 6.1]. We now present a similar estimate for surface splines on R d where the truncated Lagrange 
function is corrected by perturbing its coefficients slightly. This is done by using the orthogonal projector 
having range _L (II m _i |r(£))- Keeping this perturbation small is essential to our later results, so we must 
estimate it. We use the following result about Gram matrices for polynomials sampled on finite point sets. 


Gram matrices for polynomials restricted to point sets Let N = dimn,^! and consider A' C R d 
a finite point set. For a basis {pi,... ,pn} of II m _i, denote by 4>_y the (Vandermonde-type) matrix with N 
columns and f/X rows whose j th column is pj restricted to X. In other words, 

£ M (#x)xAr (R) with {®x)u=Pj( l 0- ( 4 - 4 ) 

Lemma 4.4. For every m £ N, and any radius r > 0, point x £ R d and point set X C B(x,r) with fill 
distance h < h^r, where ho = ho(m), the inverse of the Gram matrix Gjc = 4>y4>y £ Mjvxiv(R) has norm 
bounded by 

||G ^|| 2 —>2 <Cr~ 2 ^ 

with C = C{m , d). 

Proof. From (231 Theorem 3.8 and Corollary 3.11], we have that if X C B(x,r) has fill distance h < hor, 
then A is a norming set for B(x,r) with norming constant 2. (Here ho = c m - 1 , from [33l Corollary 3.11].) 
This means that for every p £ n m _i, ||p||z, 00 (B( a; , r )) < 2||p|x llt^x)- 
The norm of the Gram matrix can be controlled by 

||G_y 1 || 2 ^2 = ( min (G.va.a ))- 1 and (G x a,a) = ||$xa||| (x) = \\RxVa\\ 2 e {x) 

where V& := X£i a jPj an d Rx is the restriction operator RxV a = a jPj \x ■ For h sufficiently small, 

the norming set property ensures that 

lblU oa (B(s,r)) < 2 ||I?.yP||^ 00 (X) < 2||FyP||<? 2 (X)- 
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On the other hand, we have the following growth properties of polynomials there exists a constant 

C m > 0 so that for every 0 < r < 1, ||p||i 00 (B( x ,i)) < \\p\\L ao (B(x,r))- Returning to the basis 

(pi, ■ ■ ■ ,Pn ), we have 

N /1\ m— 1 N 

||a||^ 2 (JV) < C'm.dll < C m ,d i-J II y^ J 0'jPj\\L ao (B(x,r))- 

j =1 ' ' j =1 

This gives ||a||^ 2 (jv) < Cr _ ( m_1 )|| a jPj \x IU 2 (.xjj and the result follows. □ 

A bound similar to this for using spaces of spherical harmonics in place of n m _! has been demon¬ 
strated in 0 Lemma 6.4], while [15] gives general conditions for the auxiliary space of a CPD kernel. 


Modifying coefficients We use the matrix $ T (£) to construct P = 4>x(£)( < Fx(£) < ^T(j))” 1 d > Y(£)’ or ~ 
thogonal projector which has range II |r({) and kernel _L (II |-r(^))- For a fixed £, denote the truncated 
coefficients (A^j^gx^) € R T ^ by At. In order to satisfy the side conditions, we generate the modified 
coefficients = (A^) e R x ^ via 

Aj = At - PA t . 

In other words, A^ is the orthogonal projection of At onto _L (II |t (g))- Define the ‘truncated’ Lagrange 
function as 

XZ : = E A^rni-~ () + P- 

CeT(£) 


Lemma 4.5. Let m > d/2 and consider the surface spline radial basis function k m = (f> m described in (2.6). 
For sufficiently small h we have 


|| A - A|U 2(t(€)) < C p 2m h K ^ 2+1 - 3m+d | log^l 1 -” 1 

with C = C(m, d). 

Proof. We estimate the (.2 norm of the difference of the coefficients as 

ll A « “ A dk(T(£)) = ||PA ? ||^ 2 ( T( j)) = (^?(j)A ? ,GYj|)$T(5)Ae) 1/2 < 112^2IIIU 2 (at)- 


(4.5) 


Since X) Ce g A Ci€ p(£) = 0 for all p G II, we have $x(4) A ? = -(E f e§\ r(Z) A C,ZPj(0) j=v 
Applying the estimate (2.2) the ^(N) norm of I * 2 A is controlled by 


N 


d > T(£) A dlA(iV) ^ II^T(£) A dlA(JV) < E 

3=1 


E A czPj(0 

CeH\T(e) 


N 


< E i A c,dE i^(oi- 

C6H\T(£) 3=1 


In the first estimate we use the inequality \ c j\ 2 < (E Iffil) 2 - Applying Holder’s inequality and (3.6) to the 
right hand side gives 


$ x(f) A dl e 2 (N)< N E ( j “ ax Jv l Pi (OI) |Ac,d < Cq d 2m E (™ x N \Pj(0\)exP ( -v 


dist(C, C) 


j=l-N 


Ces\T({) fes\T(s) 

where we have absorbed N (recall that N = dimn m _i depends on m and d) into the constant C. 
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We now recall the argument in (4.2) which allows us to estimate the above sum by an integral: 


1 $ 


T(£) 


Afll^JV) < Cq 2m ( 
J Is 

< Cq~ 2m 


max 

Kh\ogh3 =1 --- N 


Pj ' [ L ao (Bte,zj) 
max(l, z m ~ 1 )e~ uz ^ h dz 


z / h dz 


J Kh log h 

^ (j p2m j^Kv/2-\-d— 2m 


(4.6) 


In (4.6) we have used a change to polar coordinates, as in Lemma |4.2| 


Estimate (4.5) follows by combining Lemma 4.4 (using r = Kh\ log/i|) with (4.6). 


□ 


As in the positive definite case, we are able to control the truncation error measured in suitable smoothness 
norms - the only requirement is that the kernel is bounded. In the conditionally positive definite case, the 
kernel may be unbounded, so we measure the norm over the bounded region Cl. Specifically, the surface 
spline f> m £ W^ loc (K d ) for all a < 2m — d + | (as well as C 1 Q C (M d ) for a < 2m — d). There is T < oo 

(depending on cr, p, m and Cl) so that for £ £ Cl, ||</> m (- — C)llw^(n) < r. 


Lemma 4.6. for 1 < p < oo and cr < 2m — d + ^ 


||X€ - Xdknn) < Cp 2m+d / 2 h Kv / 2+1 - 3rn+d \\ogh\ d / 2+1 ~ 


with C = C(cr, to, p, Cl). 

A similar result holds for p = oo, replacing Wff (f2) by C a (Cl) for a < 2to — d. 

Proof. The Sobolev estimate holds by considering 

llxj - Xfllw^(n) < X! I^.C _ A 5 iC ||| 0 m (- - C)llw-(n) + ^ |A^^|||0 m (-- C)llw^(n)- 

CeT(«) C£T({) 


The first term can be bounded by introducing the constant T := max^ g j=j \\4>m(- — C)llw^(n) < oo, which 
gives EceT({) \ A U ~ A Z,c\ Il0m(- - C)llw-(n) < r||A s - Adk(T(£))- Employing (4.5) yields 


y. |A^,c - A €i ^|||0 m (- 

C£T(S) 


C)lk-(O) < CTp 2m h Ku ^ 2+1 ~ 3rn+d \ log /i| 1 _m (#T (^)) d/2 
< CTp 2m+d/2 h I ^ +1 - 3rn+d \ log h\ x ~ m+d/2 . 


For the second inequality we have used the estimate #T(£) < Cp d \ \ogh\ d 

The second term is bounded by Ec^t(£) IA|,d \\4>m(- — C)llw^(n) < rE c^t({) l^s.d w hich can be further 
treated with Lemma 4.2 <-ih+ain 14, <r nrn 2m hhA- 2m + d 


to obtain Efgr(f) l^clll&nG “ Ollw-(n) < CYp 2m hr 


O 


4.3 Local Lagrange Functions 

In this section we consider a locally constructed function b £. Our main goal is to show that for 5 C Cl, there 
exist functions b $ defined on K d , so that || E^gh a ckllw^(fi) < C'/iJ’ _cr |l a lk( H )‘ 

At this point, a standard argument bounds the error between b^ and (this argument is essentially the 
same one used on the sphere in 0 ). This works by measuring the size of b^ — xe, £ S(T(£)). 
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4.3.1 Bounds for local Lagrange functions: Matern functions 


For the positive definite case, the argument is fairly elementary. For ( £ T(£), let y^ := b^(() — Xz(0- 
Observe that = X<eT(£) a C k m{‘ — C) £ S'(T(^)), where a = (a^) and y = (y^) are related by 

Kx(f)a = y. The matrix (K X (f)) -1 has entries ( A i;,ri)<;,ri£r($,)- 

For a kernel of order m, the entries of the matrix A = (^c,’?)c,^eT({) can be estimated by (3.3): |A^ jr; | < 
Cqd- 2 m p. f 0 n 0W s that (K X (j)) _1 has matrix norm 


( k T(o) 


< C(#T^))q d - 2m < Cp 2m \ log h\ d h' 


d d—2m 


(Here we have used the estimate #T(£) < Cp d \ log h\ d .) Consequently ||y||i < (#T(^))||y|| 00 . 
Because y f = y c (C) - x«(C) for C € T (£) and ||x« - xdl°° < C'llxc - xdlwTCR-q we have 


Y l a cl < ( k t(|)) 

CeT(|) 


-1 


l-)-l 


U < Cp 2m+d h d ~ 2m \logh\ 2d \\ Xi - xdk 2 -(RO- 


For a generic norm || • \\z for which max. £ ^ ||fc m (- — z)\\z < T we have \\b^ — Xz\\z < T|a^|. We now 
have the counterpart to Lemma 4.3 which shows that (4.1) holds for Matern kernels. 


Lemma 4.7. For k m = K m , and for 1 < p < oo and a < 2m — d + d/p we have 

ll&£ -xdl^(*0 < Cp Am+d h K ^ 2+2d - Am \\ogh\ 2d 


(4.7) 


with C = C(m, d). 

Setting | log h \ 2d < Ch -1 (either by finding a sufficiently small h* so that this holds for h < h*, or by 
increasing the constant, or both), and by employing a simple interpolation inequality, we have 

II k - xdl w«m ^ Cp Am+d h J , J = Kv/2 + 2d - Am - 1. (4.8) 


4.3.2 Bounds for local Lagrange functions: Surface splines 

As in the previous section, we are guided by the estimates for local Lagrange functions on the sphere J3] 
Proposition 5.2]. 

In this case we have — b^ = J2ce T(£) a (4 > m(- — 0 + Ej=i c jPj £ The vectors a = (ac)cex(£) 

and c = (cj)j =1 ... N are related to y = (yc)ceT(j) = (Xc - &c)eex(£) by 





where K X (^) is the collocation matrix and $ is the Vandermonde matrix introduced in (4.4). The norms 
of a and c can be controlled by ||y||^ 2 (T(£))- This is demonstrated in j8( Proposition 5.2], which shows 
that ||a||^ 2 ( X (£)) < d _1 ||y||^ 2 ( X (j)) where d is the minimal positive eigenvalue of P ± K r fnP ± ■ Recall that 
P 1 - = Id — P and P = 4>(4> T $) _1 4> 2 is the projector introduced in Section 
We make the following observation, which is [5] Proposition 5.2]: 


4.2.2 


ll a lk(X(£)) < ^ 1 ||y|U 2 (-r(e)) and ( 4 - 9 ) 

ll c IU 2 (AT) < 2 max \(/> m {v - C)ll|G x L|| 1/2 ^ _1 (#T(0)||y||^ 2(X(4)) (4.10) 

V: CeT(^) 


It is possible to estimate the size of $ by considering the matrix of kernel coefficients for the Lagrange 
functions b VyX ( 4) = E<eT({) A C,v k m(-,0 + Hf=i Bpv&r 
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Lemma 4.8. For ft, the minimal positive eigenvalue of P- L K T (^)P ± , we have ft 1 = ||.4||2->-2> where A = 
(A^ >ri )^ i??g x(£) the matrix of kernel coefficients for the Lagrange functions in S( T(£)). 

Proof. Writing B = ( Bj v )j=l. n it follows that Kx(f)A + = Id. From this we have P 1 - = P^Kx^A 

v£T(t) 

and ker A C ker PU On the other hand, each column of A satisfies the side condition J2 v eT(n = 0 

for all p £ II, so ranA C ranP- 1 . From this it follows that ker A = ker P^ and ranA = ran P 2 - . 

Because P^A = A we have P ± = P^Kx^jA = P^KxmP^A, and the nonzero spectrum of A is the 
reciprocal of the nonzero spectrum of P ± K.y(£)P ± ■ In other words, $ _1 = max^g CT (^) |A|. □ 


Applying Gershgorin’s theorem to A, whose entries are A^ v = (&£,x(£)i and therefore satisfy 

\A c ,n\ < Cq d ~ 2m , we have ft- 1 < C( 1 + #(T(£)))< 7 d_2m - By Q we have 


a|U 2( x (e) ) < Cp 2rn h d - 2m \ log/i| d ||y||^ 2 ( x(|))- 


(4.11) 


Using Lemma 4.4 we have that ||G X ^)|| 1 ^ 2 < C m ffiKh\ log h\) while $ 1 < C myd p 2m h d 2m \logh\ c 


and (#T(0) < C m ,dp 2m \logh\ d . Applying (jLlOj) gives 

l|c|U 2 (iv) < C m ffi2Kh\ log h\) 2m ~ d [{Kh\ log h\) 


— (m —1) 

< C m , d p 2m+d h-( m ~V I log /i| m+ 1 +d ||y||, 2 (x( 0 ) 


(p 2m h L 


— 2 TO I 


log nn (p d \io S hn 


(4.12) 


We are now in a position to prove that (4.1) holds for surface splines. 


Lemma 4.9. Let k m = (f> m , the surface spline RBF on and let J > 0. For ScH, form the local Lagrange 
functions b% G T(£), with Y(£) = Sfl P(£, Kh\ log h\), where J = — 5 m + d + 1. Then for 1 < p < oo 

and a < 2m — d + and for sufficiently small h, 


\\h-xdw ?i n)<Cp 4m+2d h J 


with C = C(a,m,p,Q). 

Proof. We use the triangle inequality ||- xdlw/(fi) < \\h - xdlw-(n) + ||X£ - Xfllw-(n), noting that the 
second term has been estimated in Lemma |4.5| and that the first can be controlled as 


\\h ~ Xdlw-(fi) < ll a lk(T(j)) m a* II ~ ^)llw^(n) + ll c IUi(iv) max II(•)II- 

From (All) we have ||a||^ l(T ( 4 )) < \/#T(OI|a|k 2 (X(|)) and #T(£) < C mjd p d \ logh\ d , so 

l|a||/ l( x(0) < Gp 2 m+d/, 2 /i d_2m | log fo| 3d / 2 ||y|U oo( x($)) 

< C p im + d h Kv / 2 - 5rn+2d+1 \ log/i| 2d_(m_1) 

where we have employed the result of Lemma |4. Gland the embedding W™ C Poo to estimate ||y| 

||6f - xdU.(B(£,*fcUs£iJ)) ^ C d>m p 2m + d / 2 h K ^- 3m + d I logPI 1 -^/ 2 . 

Similarly, from (4.12), we have 


l^(T(?)) ^ 


||c||* l(JV ) < C' / 9 2 m +d+d/ 2 /l -(m-l)| log ^| m +l+d+d/ 2 | 

/~i 4m-\-2d fKv/2— 4m+2+d| 


ylUoo(r«)) 


i •. 7 19-L9/-7 


Because max zg x({) \\(t>m(-^z)\\w c ’ (n) and maxi<j<jv ||0j(‘)llw‘ , (r2) are bounded by a constant T which depends 
only on ft, m, p and a, we have 

\\h - xdlw-(n) < TCp 4m+2d h Kv/2 - bm+1+d \ \ogh\ 2+2d . 

The lemma follows for h sufficiently small that | \ogh\ 2d+2 <h~ x . O 
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4.4 Bernstein type estimate for local Lagrange functions 


In this section we discuss the local Lagra nge (b g) functions generated by k m and the centers S. We develop 
partial Bernstein inequalities similar to (3.12), where for functions E^e" 0 ^ £ Vs, smoothness norms 
11 s| I Wp are controlled by an t p norm on the coefficients: ||a||^( S ). 

Theorem 4.10. Consider the family of local Lagrange functions generated with K > 10m ~ 2 . p or 0 < cr < 
m — {d/2 — d/p) + when 1 < p < oo (or cr £ N and 0 < a < m — d/2 when p = oo), we have 


Ces 

where C = C(a,p,m,fl). 

Proof. We start with the basic splitting 


W5(Q) 


< Cp 4m+3d h d /P-°\\a.\\e AE 


(4.13) 


: = E = ( E 

jes Ees 


=-g + b. 


Applying the Sobolev norm gives ||s||w^(f 2 ) < IIGHw^qn) + l|F?||w p (n)- From (3.12), we have 
||G||^ ( o) < Cp m+d ' 2+d 'Ph d 'v-° ||a|U, (a) < Cp Arn+M h d / p ~° ||a|| MH ). 

Taking the L p norm of B, we have || Efes a ?( 6 « “ X«)l|w-(n) < max| eH ||&£ - X||| w?(f2) EjeH l°d- We 
control the l\ norm by using Holder’s inequality ||a ||i < (#2) Pp ||a|| p and < Cnp d h d . Using Lemma 


4.9 (or Lemma |4.7| in case k m = K m ), we arrive at the desired inequality 

II Y, a dh - Xe)llw-(n) < Cp 4m+3d h J ~ d ^ ||a|| MS) < C p im+3d h d 'r-"\ |a|| MH) (4.14) 

because the choice of K ensures J > d — a. The theorem follows. □ 

For s £ Vs = span^ gH b$ we may replace the discrete norm ||a||^ p ( S ) by its equivalent /i _d / p ||s||Lp, as we 
now show. 

Proposition 4.11. (Local Basis Stability So Nikolskii Inequality) For every po > 1 there exists a 
constant h 0 > 0 so that i/ScO has fill distance /i(S, 12 ) < ho and mesh ratio p < po, then the family of 
local Lagrange functions generated with K > 10m ~ 2 satisfies the bounds 


c a 


p( E) < q~ d/p M Lpln) < Cp m+d ' p \ 


.(H) 


for all s = Efes £ Fe> with c = c{p, m, 12) and C = G(12, m). In addition, for 1 < p, r < oo, 


l s IU P (n) < Gg 




S ll-Mfi) 


(4.15) 


have 


(4.16) 


with C = C(p, r, p, m,Ll,). 


Proof. The upper bound follows from the previous theorem, with cr = 0. To obtain the lower bound, note 
that g _d/p ||s|| Lp(n) = q~ d /p\\ E^s a dh ~ Xi) + E^e a SX'dU P (n)- Consequently, by (JUj), (^ and d4.!4| ), 
we have 


9 d/p ||s||L p (n) 


> Q~ d/P (llE^Xflkcn) - Gp 4 -+ 3 d / l J -< E i i )||a||, p(H) ) > ( Cl - Cp 4m+3d h J ~^ 


k P (H) 


where C\ = Ci(p, 12, m) is the constant from Proposition 3.2 Let ho > 0 be such that Ci—Gp 4m+3d /iQ d > \c-\. 
This guarantees the same holds for all 0 < h < ho- The proof of the Nikolskii inequality is, mutatis mutandis, 
that of Corollary |3.3| □ 
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5 Main results and corollaries 


At this point we can prove the inverse inequality for local Lagrange functions in Vs' 

Theorem 5.1. Suppose 12 C is a bounded Lipschitz region. For m > d/2 and for every po > 0 there 
exists a constant ho > 0, so that if 2 C 12 has mesh ratio p < po, fill distance h < ho, and if 2 C 12 is a 
suitable extension o/S (as mentioned in Remark 2.3) then for all s = £ Vh the following holds. 

For 1 < p < oo and all 0 < a < m — ( d/2 — d/p)+, or for p = oo and an integer a < m — d/2, we have 

INIw^qQ) — ll s ll-Lp(n) 

with C = C(m,p,Q.). 

Proof. This is an immediate combination of Theorem |4.10| and Proposition |4.11[ □ 


5.1 Restriction to the boundary 


Immediate applications of Theorem 5.1 are the following “trace” estimates. (Since the elements of Vs are 


continuous, it is appropriate to consider these results about restriction to the boundary.) To make sense of 
these, we first need to describe Sobolev spaces on the boundary <912. 


5.1.1 Smoothness spaces on <912 


We use the common tactic of employing a partition of unity with corresponding changes of variable to flatten 
the boundary. (As in [31] 1.11] and [21], for instance.) The details of the partition of unity and change of 
variable depends on the smoothness of the boundary, and this influences the types of Sobolev spaces we can 
define (namely, the maximum order of smoothness is governed by the smoothness of the boundary). 

For a domain whose boundary is Lipschitz we consider a partition of unity (i(j)jL l of <912, where each 
npj : <912 —> [0,1] is Lipschitz, and let ( Uj,hj)jL 1 be a corresponding collection of bilipschitz charts so that 
each Uj is an open set in <912 containing the closure of supp (ipj) and each hj : Uj —> Oj C M d_1 is a bijective 
Lipschitz function. Then for 1 < p < oo and 0 < a < 1, the Sobolev space W p (912) consists of functions 
/ £ L p ( <912) such that 

MKr-fon) := EH(^ 0 ^7 1 ))(/°( /l 7 1 ))ll w ? (o 3 ) (5- 1 ) 


is finite. 

For higher orders of smoothness, we simply increase the smoothness of the boundary, and the partition 
of unity and chart. For a < M, let (i/jj)R =1 be a C M partition of unity of 912, and let {Uj,hj)^ =1 be a 


collection C RI charts. Then 1F^(912) consists of functions for which the norm (5.1l is finite. 

We note that this transporting of norms from Euclidean space to manifold by way of partition of unity 
and pull-back can be carried out for other smoothness spaces. In particular, it holds as well for the Besov 
class (see again [31] and [21]). For this reason, it follows that for fractional a, (912) = Bf p {dVL) with 
equivalence of norms (as in the Euclidean case). 


5.2 Trace estimates 


We may use Theorem 5.1 to obtain the following trace estimate for functions in Vs- This is non-standard 


because the norms of the trace are bounded by Lp norms rather than Sobolev norms. 

for s £ Vs we have, for 1 < p < oo and 0 < a < 


5.1 


Corollary 5.2. Under hypotheses of Theorem 
m-l/p- (d/2-d/p)+, 

IMIw^an) < CA -<T-1/p || 1 s|U p (n) 

with C = C(m, p, 12). 
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Proof. For a > 0 we have that Wp +l ^ p (12) = -Bp,p ly,p (f2) and W p (<912) = Bp p (dfl). It follows that Tr : 
Wp +1 ^ F (12) —» W p (dLl) is bounded by the trace theorem (one will find a suitable one for smooth boundaries 


in GEE 3.3.3], and for Lipschitz boundaries in |211 Theorem 2.1]) so 


Mlwwan) < Co||s| 


w, 


cr + l/p 


m 


< Ca P h-°- 1/P \\s\ 


L p (£l) 


□ 


The first inequality is from the trace theorem, while the second follows from Lemma |B.6[ 

We can get a similar estimate for a = 0, although this requires a modified trace result. 

Lemma 5.3. Suppose 12 is compact with C 1 boundary. For 1 < p < oo there is a constant C p so that for 
all u £ C 1 (f2) and e > 0 we have 


MImso) 7 c v ^~ q/v IM 


Lp (fi) 




WFSl) 


with q = . 

Proof. Note that in this case, we consider Sobolev norms over 12, so for the norms on the right hand side, 
we make use of the definition given in Section 2.4 The L p (<912) norm on the left is with respect to surface 
measure, but this can be estimated in a standard way (by partition of unity and change of variables). 

We begin by proving a trace result for 12 = R+ = R d_1 x [0, oo). For u £ C 1 (R^.) having compact support 
and x' £ R d_1 , let r x > be the first positive zero of £ i—>■ \u(x',t)\. Then 

\u(x',0)\ p < [ d d \u(x',x d )\ p dx d 

Jo 

/»oo 

<p |k.(cc', aid) | p—1 \d d (u(x',x d ))\dx d 


< 


C(e)\u(x',x d )\ (p 1)p - 1 + e\d d {u(x' ,x d ))\ p dx d 


The last line uses Young’s inequality ab < C(e)a q + eb p with C(e) = q 1 {ep) q ^ p ■ Integrating this over ' 
we have 


d-1) ^ Cp€ 


~ q/p \\i 


+ e w 




(5.2) 


Now let (’F/)!) =1 be a finite collection of non-negative, compactly supported, C 1 functions so that ^ = 

1 in a small neighborhood of <912. Let ( Uj)j I =1 denote a corresponding collection of open sets so that 
supp(4/)j C Uj and so that there is hj : Uj —?■ B(0,ej), an open ball in R d . 

For / £ C ,1 (12) and 1 < j < N, the pro duct 'bjf is compactly supported and (extending by 0) we have 


“ (*i/) ° (hj 1 ) e C, 


+)• Applying d5.2|> to (Vjf) o h j 1 gives 


Lp 


d- 1) ^ Cp€ 


- q /p 111 


"P 'Lp 




. Because 'Fj and h- 1 are C 1 over compact sets, their norms can be bounded independent of j. 


By applying chain and product rules, it follows that 


N 

£iw p 

9=1 


*3 M L„ 




+ e u 


Iw-i(n)) 


(5.3) 


with an increased constant which depends on that of (5.2) as well as maxi<,-<jv II (hj) 1 |l / _ 1/ . .. \ 

— J — J Ciyhj (supp(^j))-J 

and max-] < 7 <v II IICi (M d ) - Because ^j| en ) is a partition of unity for <912, the left hand side of (5.3) controls 
the L p norm of tt| an , which gives the e-modified trace inequality 


ll P p (an) 


7 C'p,q( 


II £n) 


II WPS!) 


)■ 


□ 
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Corollary 5.4. Let LI be a bounded domain with C 1 boundary and assume the hypotheses of Theorem 5.1 
For s £ Vs we have, for 1 < p < oo and 1 + ( d/2 — d/p)+ < m that 


l s IU p (dfi) < Ch 1/p ||s|| Lp (n) 


with C = C(p, p,m,Cl). 

Proof. For p = 1 the Theorem follows directly from the boundedness of trace from W((Cl) to Li(dCl) (see 


[3 Theorem 1 Chapter 5.5]) and by repeating the argument of Theorem |B.6| 


For 1 < p < oo, we apply Lemma 5.3 with e = h p 1 (so that e q ! p = h p- 1 ) followed by Theorem 
Thus, 


B.6 


\L p (dQ) 


< C{h~ l \\s 


-1IUIIP J./jP-lllollP 


M fi) 
lip(n) 




< C(h~ 1 ||s||? + 


(f2) 


and the result follows by taking the pth root. 


□ 


A Energy and pointwise bounds on the Lagrange function 

In this section, we show that Lagrange functions for surface splines and Matern kernels satisfy decay estimates 
as in Section ETT1 

We say that Cl satisfies an interior cone condition if there are constants p £ (0,7t/ 2) and 0 < R < oo 
so that for every x £ SI there is a cone = {y \ \y — x\ < R, n ■ (wEfy) ^ cos^j} opening in the direction 
determined by the unit vector n so that Cn C Cl. 

We recall the zeros estimate ms Theorem A.11] for a bounded region Cl with Lipschitz boundary (the 
version we cite is a streamlined modification of an earlier estimate given in [261 Theorem 2.12]). 

Lemma A.l (Zeros estimate). Let 1 < p < oo and m > d/p (when p = 1 we may have m > d/p). 
Suppose Cl satisfies a cone condition with aperture p and radius R. Then there are constants hi (depending 
on m and p) and A (depending on m,d,p,p) so that if X C Cl has fill distance h < h\R and if u £ Wff'iCl) 
satisfies u \x = 0 then 

IMIw^Cfi) < Ah m fe |M|w>"(fi) 

and 

IMUoc(fi) < lCh m ~ d/p \\u\\ W nv(Q.)- 

An important feature of this lemma is that the density h is controlled by the cone radius R, but the 
constant A does not depend on R. This allows a comparison of results across sets which are geometrically 
related. E.g., annuli B(x,r 2 ) \ B(x,r\) satisfy cone conditions with aperture p independent of r 2 and r i, 
and with cone radius equal to half the thickness 7-2 T=* . Thus, the above result holds for any point set with 
h < h\(r 2 — r\)/2. With almost no modification, this result extends to balls B(x,r ) (where h < h\r) and 
complements of balls (where there is not restriction on h). 

Consider now the annulus a(£,r, t) := {x £ | r — t < |:r — £| < r}. Applying Lemma A.l with p = 2 

and k = m — 1, we estimate the Sobolev nornj®|as |M|pym (a) < + mA 2 h 2 \\u\\‘^ vm ^ 1 ^ which, after 

rearranging terms, implies that IMIvvm( a ) ^ i-mA 2 h 2 \ u \w m (a.) ^ or u vanishing on X C a with h < h\t/2. In 
short, if h < min(^, / 12 ) with 

/12 := ( V2mA)~ 1 (A.l) 


by 


5 Recall that we use the Sobolev norm as defined in Section |2.4| - in particular, the kth order partial derivatives are weighted 
m\ 

k 
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then 


for u vanishing on X. 


Mw 2 fe (a) < IMIwJ(a) < Ah m k \\u\\ w ^( a ) < 2Ah m k \u\ W £ 


(A.2) 


Lemma A.2. Suppose m > d/2. There is a constant v = v{in.,d) with v < 1 such that if X C is a finite 
point set, a = a(£, r, t) is the annulus of outer radius r, width t and center £ G X, and Xq = Xfla has fill 
distance h = h(X o,a) < min(-A^,/i 2 ) ? then 

• the Matern Lagrange function X{ G span{K m (- — £) | f G A} satisfies 

^^W,J' l (R< J \.B(f,r)) — ^W 2 m (m d \B(Z,r-i)) ' 

• the Lagrange function X£ € S(4> m ,X) for the order m surface spline satisfies 

l^lw 2 m (R d \B(?,r)) — ^^W 2 m (R' i \.B(£,r-t)) ' 

Proof. In either case, the function k m is the reproducing kernel for a (semi-)Hilbert space (described in 


Sections 


2.5.1 


and 


2.5.2), and we use the notation [it] TO ,y to denote ||u||w 2 m (y) or |u|w 2 m (y)i respectively. 

[0,1J be a smooth cut-off function supported on the interval (—oo, 1) equaling 1 on (—oo, 0]. 

as = r(|(|x — £| — ( r ~ t))> and note that it is a smooth 


Let r : 1 

For r > t, we define : 

function supported in the ball _B(£,r), and equals 1 in B(fi,r — t). By the chain rule, there is a bound 
\\D^T^ t r t t ||oo < Ct~\P\ which is independent of r. 

Both Xi and T^ rt x^ are Lagrange functions on X. Thus [xdm < Using the additivity of [-] m , 

and noting that the Lagrange functions are identical on B[fi,r — t) while vanishes outside B(f,r), we 
have 

[xdm — [ T e,,r.tXe\m [xdm,R d \B(f,r-1) — [ T £ ,r,iXdm,a(£,r,t) * 

By using Holder’s inequality in conjunction with the product rule, we have 


j \ D °‘{n,r,t(x)xd x ))\' 2dx = / I E (jg) ^ ^ T ix,t{x)D^xd x )\ 2dx 

< C^t- 2 ' 01 -® f \D^xdx))\ 2 dx 

/3<a , ' a 

< cJ 2 h ~ 2(H ~ m) [ |U> /3 Xj(^))| 2 dx 

J a 


(A.3) 


/3<a. 


In the last line we have used that /? < a, and thus t l a ^ = t ^/i Applying (A.2) 

to (A.3) gives, for each (3 < a, J a \D^x^( x ))\ 2 dx < C 2 /i 2 ( m_ l^l)||xdlw| n (a)- This yields the inequality 


far.tWfinVM.r-t) < E E (?) ) CV^Ed 

| a\=m ( 3 <ot ^ 

Canceling powers of h and collecting constants which depend only on m and d , we have 

[xd m, R d \B(£,r-t) — l T €,r,tXd m, R d \B(£,r-t) — c\xt] m, a 
Finally, we note that [xdm,a = lxdL,Ri\B K , r -i) - [xd™,R d \B(£,U which y ields 

C- 1 


HTW' 


[xd 


< 


m,R d \B(£,r) — 


[xdm,R rf \B(£,r— t) 


and the lemma follows with v = 


C -1 


< 1. 


□ 
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We may now iterate Lemma A.2 to get control of the “energy” of the tail of the Lagrange functions. 


Lemma A. 3. Suppose D C is bounded, and X C D is a finite point set with fill distance satisfying 
h(X, D) < /12. There is p = /i(m , d) > 0 so that for R < dist(£, dD) 

• the Matern Lagrange function X£ £ span{K m (- — £) | ( £ A} satisfies 

• the Lagrange function X£ G S(4> m ,X) for the order m surface spline satisfies 

R 


I n\wr(R*\Bu;, R )) ^ c <i d/2 ~ m ^ ( -T 


Proof. Setting t = Ah/hi (where hi is the constant appearing in Lemma A.l), consider, for r < dist(£, <9.D), 


an annulus a(£, r, t) and the restricted point set X 0 = Xna(£, r, t). The slightly smaller, inner annulus a(£, r— 
h,t — 2h) has the property that for every x £ a(£, r — h,t — 2h), there is ( £ X 0 so that dist(x, () < h (since in 
that case dist(x, Xq) = dist(x, A)). It follows that /i(Xo,a(£,r, t)) < 2 h and therefore /i(X 0 , a(£, r, 2)) < hxL. 


Now letting n = [R/t\ , by Lemma A.2 we have that 

~ — v l^lw 2 m (R d ) — v v lh ^w.p»(R d )‘ 

< Cq d / 2 ~ m , so the lemma follows with p = 

□ 


By the “bump estimate” (3.2), we have that |xd 


— log(r / ), which is positive since v < 1. 


liv 2 m (R rf ) 


Note that if 12 C is compact, then 12 = {x £ | dist(x, 12) < diam(12)} automatically satisfies a cone 


condition (with radius R = diam(12) and aperture independent of 12). Thus, the result (3.4) follows with 
D = 12, A = S, h 0 = h 0 (d , m) and (£S. 

Because x £ 12, £ £ S implies R = |£ — x\ < dist(£, <912), we can apply the second part of the zeros 


estimate Lemma A.l to Lemma A.3 to obtain the pointwise estimate (3.5) 


dist(x,£) 


(A.4) 


IX|(z)l < Ch m d/2 ||xdlw 2 "(R<i\B(j,dist({,a ; ))) < Cp m d/2 exp \^-p h 
Note that in the second inequality we have written j l m - d / 2 q d / 2 ~ m — pm-d /2 

B Stability bounds for the Lagrange function 

We now demonstrate that the family of Lagrange functions for suitable kernels over a domain 12 satisfy 


stability bounds of the form (3.7). This was demonstrated in OS Proposition 3.6 & Theorem 3.7]; we follow 
the argument presented there, with modifications for dealing with a suitably bounded Euclidean domain, 
and to obtain a necessary refinement: that the threshold fill distance ho depends only on m and d (and not 
on p or 12). 

Lemma B.l. Suppose 12 is a bounded domain and S C 12 is a finite subset with fill distance h < h- 2 , where 


h 2 = h 2 {m,d) is the constant given in (A.l). There exists a constant C 2 = C 2 (m,d) so that the family of 


functions (X{)fes have the property that for s = a £X|> 


< c 2 p 


m-\-d/2 d/p\\ 




holds. 
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Proof. For p = oo, inequality (A.4) leads to a bound on the Lebesgue constant £ for the X{’s over Cl: 


£ := sup (E^hIxsWI) < C P m+d/ \ C = C(m,d). 




(B.l) 


Indeed, for fixed itllwe note that lx?( ;r )l < Cp m d ^ 2 Efes e M h " ■ By estimating en annuli, 

using sets A n := {£ € S | h(n — 1) < \x — £| < hn} having #A„ < C , we have that 


lx«(*)l < Cp m - d/2 (l + 55 P d n d e~^ n ) < Cp m+d/2 , 

£GE n=l 


where the constant C = C(m,d ) is independent of 5. Taking the supremum then yields (B.l I. It follows 
that 

IMUoo(fl) < ^INhH^e) = 'C||a||^ 00 (E), 

For p = 1, we have 

/ |s(a:)|dx < 55 l«d / |x*(®)|da; < Ch d ||a||* l(s) . 

Jn Jn 


Here we have used the fact that 11XA111 5= Cp m d / 2 h d < C p m+d ^ 2 q d , which follows by integrating (A.4). A 
standard application of operator interpolation proves the other cases. □ 


Preliminary estimates 

Because Cl satisfies a cone condition with aperture p and radius R, there is a constant a (depending only on 
d and p) so that for all x £ Cl, 

ar d < vol(B(x, r) n Cl) 

for r < R (the radius of the cone condition). Similarly, we have that there is a constant K (depending only 
on d and p) so that 

#(SnB(i,r)) <K{r/q) d . (B.2) 

We can use a simple modification of the zeros lemma H3 Lemma 7.1] valid for balls, which states that 
there exists a constant /13 > 0 depending only on m and d so that for h < ho, the Holder-like condition 

\xdx)-xdy)\<Cp m ~ d/2 (^^y 

holds for 0 < e < m — d/2 and with a constant C = C(d, m). 

Remark B.2. For the remainder of this appendix, we assume 5 is sufficiently dense that h(E, Cl) < ho := 
min(/i 2 , hf). We note that ho depends only on d and m (because this is true for /12 and ho). 

This permits us to understand the structure of around the centers ( £ H. Namely, because xd£) = 1, 

2 

Xd x ) > ^ for x in £(£, 7 ?) 

whenever < 1/(3 Cp m ~ d ^ 2 ). For the off-center case (i.e., when £ f £), 

\Xq(x)\ <Cp m ~ d/2 ^ e , for x in B(£, yg) (B.3) 

with a constant C = C(d,m). 
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Now fix 0 < 7 < 1/(3 Cp m 'V 2 ) 1 / 6 and define := fl n B(^'yq). The above estimate guarantees that 
0 ( 7 q) d < vol (B$), and 

a(iq) d < j B IXf(z)| p dx => a{^q) d a i\ P <J2 J B l ffl «X|(a)l p dx. 

This is the starting point for the corresponding lower bound to Lemma |B.1[ since the quasi-triangle inequality 
(■A + B)p < 2p~ 1 (Ap + Bp) implies that |a£Xf(x)| p < 2 P_1 (| E^es a CXc(®)l p + I Ec#? a CXc( a; )l P ) and so 


«(79) d f|VEl a «l P ^ 2P_1 I 

:/( 

^2 a cxd x ) 

V 

+ 

E a acW 

dx. 

(B.4) 



Ces 


c,+t 

) 



The desired lower bound is Cig d / p ||a||^ p ( S ) < ||s|| p , so we must estimate the size of the overestimated “off- 
diagonal” terms: 2 p - 1 J2^ e sf B( I E^« a CXc( a: ) ^de¬ 


controlling the off-diagonal terms 


y 

This is done in two stages, by splitting E^eH In E{ce=-c^£} a CX(i x ) dx = E^es^ + IIt) where the 

p 

first term is the “far away” contribution I^ := f B E{^g~- |c-{|>rq} a CX(( x ) da: and the second is the 

p 

nearby contribution 11^ := J B E{<;e=- |c-CI<r<?} a CXc(e) dx. These depend on an (as yet) undetermined 
parameter T > 0. First we use the exponential decay of (A.4) to control the far away portion of the 
off-diagonal part, Then we use the Holder estimates (B.3) to bound the nearby portion. 

Lemma B.3. For every p £ [1, oo) there is a function F : (0, oo) — > R. satisfying lirn^oo F(t) = 0 so that 
for every T > 0, we have the inequality 


E 

«es' 




E a CX(( x ) 

fees: |C—Cl>r<j} 


dx<F(r)( 7 g)V (m ' 2) l|a||£ 




holds with F( T) < Ce * pr with C = C{m,d,p) and p = p(d,m) the constant from (3. 41- 
Proof. We sum over the non-overlapping dyadic regions 

n k ■■= fifc(0 := {C e S I r2 k q < dist(£, 0 < T2 k+1 q}, k = 0,1,..., N q , 


where 2 Nq ~ dla ™(n) . rp^jg meang that, for M k := f B E^eOt a (X(( x ) 


dx, 


N„ 


4 <E 2(P 


-l)(fc+l) 


fc =0 


/B £ 


E °cxc(*) 

C6 W 


N a 


dx = E 2 ( P -i)(fc+i ) Mky 


k —0 


where the above inequality follows by iterating the quasi-triangle inequality | A + B\ p < 2 P 1 (|^4| p + \B\ P ) to 

get|E"=i^r<E^ 1 2^- 1 )|A,| p . 

We now estimate the contribution from each M k , the portion of II £ coming from the dyadic interval £l k - 
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By using the (generalized quasi-triangle) inequality | 1 A?'I P < n p 1 \Aj\ p , we have 

M k < (#Q. k ) p ~ l x V [ |a f Xf(a:)| p da; 

,-^n JBe 


Cent B s 


< (#n k ) p 1 x max ||xcIIl i( b s) x E l a <l* 


CeW 


< (K{2 k+1 T) d ) P C( 7 9 )V (m “ 2 ) (exp(- w r2 fc )) E |a c | p - 

Cen fc 


(B.5) 


In the final line, we have used the estimates (B.2) and (A.4). 

Multiplying by 2( p_1 ^ fc+1 ^ and summing from 0 to N q , we obtain (after rearranging some terms and 
combining constants which depend only on d and p) 


d ( N « 

I £ < C(jq) d p p(m -^ ^(2 k ^r d ) p -^M-^2 k ) 

\k =0 


We can now sum over £, obtaining 


N a 


Y,h<c{ iq ) d P p(m -2 } ^ Hd+1) ^ d r-^M-^2 k )Y: 




,fc=o 

' N a 




E kl p 

Cen k 


E kl' 


<C{'yq) d p p(m -2 ) ^2(2 k ( d+1 ' ) r d ) p - 1 exp(-ppr2 k ){K(2 k+1 T) d ) 


,k=0 


El«cl p 

Ces 


< C 


(7 q) d p P(m -i 

r p 


/ N a 


^( 2 fc r )(d+i) P ex p(- A tpr2 fc ) 


,k=0 


E kl” 

css 


In the second inequality, we have exchanged summation over £ and £. In short, we have used 


E E iki’] = EE 

CesCeW C 6 =«es 


Xn*tt)(0 E N* 

CeW 


in conjunction with the estimate #{£: C € f2fc(£)} < A"(2 ;c+1 r) d obtained from (B.2), since for ( £ S, 
#{£: £ £ flfc(£)} = #£l k (Q. In the final inequality, we have used the fact that 2 ( fc + 1 ) d < 2 fc ( d+1 ) x 2 d+1 and 

i -p(d+l)p 

that r p = —. We estimate this with an integral as 

g/£ < C (r dp exp(-ppT) + E J\xp(-ppry d +Ap-^ ( 7 g)V (m -^l|a||^ (H) . 

Which shows that F(T) := C (T dp exp(—ppT) + ^ f£° exp(—ppr)r( d+ 1 ) p_ 1 dr). 

The integral term can be bounded by making a change of variable J? = tT as 


2 ' , °° 
p P 


/»oo /»oo 

J exp(-ppr)r {d+1)p ~ 1 dr = 2T dp J exp(-ppTR)R^+^^dR < C , ri:P . m r rfp exp(- W r). 
Because maxr>i T dp exp(— ^pT) < Cd, P ,m, the estimate F(F) < Ce~? pT follows. 


□ 
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Lemma B.4. For every p £ [l,oo) and every T > 0, we have the inequality 


E 


ce 


~JB f 


E a c xd x ) 

{CeH: e^C,IC-CI<r g } 


da; < C(rV) P (79)V (m "2 




"4(H) 


Proof. Note that #{£ eS: C 7 ^ Cj 1C — £1 < Tg} < KT d , so by the quasi-triangle inequality, we have, for 
each C € S 


//« < / (AT^p - 1 £ |o c xc(*)| p da: 

dist(C,£)<r g 

< f {KT d ) p - 1 (C P m - d/ 2 'y e ) p E M P da; 

dist(c,5)<r g 

< Cr d ( p - 1 '>'y ep fP( m - d / 2 \'yq) d ^ |a f |4 

dist(c,5)<r 9 


In the first ineq uality we use the estimate on the number of centers ( |B.2[ ). In the second inequality, we use 
the bound ( B.3| ). The third inequality follows from the simple estimate vol(He) < C(-yq) d . 

Summing over £ £ 3, we obtain; 

E /7 c ^ E l«(Or < c(rV) p (79)V (m -^l|a|| p p(H) . 

Ce s 4eHdist(c,5)<r g 

The final estimate results by exchanging the two summations, and employing the fact that £ 5: dist(C, £) < 
Tq} < KT d . This completes the proof of the lemma. □ 

Lemma B.5. There exists a constant 7 satisfying 7 > Cp ^ (log {p))~ d ^ e with C(d,m,p,e), so that 

2 p_1 E / |E°<x<(*) P(ix - b a ( 79 ) d (I) Ei a «i p 

Ce= B £ c^C ' ' Ces 

holds for all a € £ P (S) and all p £ [ 1 , 00 ). 

Proof. By the quasi-triangle inequality, we have 


E / |E°cx«( a 

c^c 


da; < 2 P 


-1 


E^ + E^ 

UeE Ces 


Apply Lemma 
in Lemma B.3 


B.3 


and choose T so that Ce ‘i pT = \a {\) P p p ^ m 2 \ where C is the constant appearing 


We note that our choice of T guarantees T < Cd, P ,m log(p). By Lemma B.3 F(T) < 


|a(|) P p p( ' m 2 \ it then follows that 


E 


Ce 




E a cxd x ) 

IC 6 H: |C-CI>r 9 } 


dx< 4 -^Q) P ( 7 9 ) d EKI P - 

V J Ces 


Now select 7 so that both 0 < 7 < 1/(3 Cp m and C'(r d 7 e ) p < (|) P p p ^ m d hold. The 

problem of choosing 7 can be rewritten as 7 e < p d / 2 “ m min(g^, C c i,m,p^'~ d )- Since T _d > C'rf iPim (log(p)) _d , 
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it suffices to take j e = Cd,m, P p d ^ 2 m (log (p)) d for some constant Cd t m, P 
guarantees that 


For this choice of 7 , Lemma 


B.4 



{CCS: 


p 


E a cxd x ) 

C^S,IC-d<r 9} 


dx < 4 p a 



(79) d EKI P 


as well. 
Thus, 


2 p 


- X E 

ees- 


s(?,7 1) 




dx < 4 p- 1 + n i) < 



and the result follows with 7 > Cd, m ,p,eP d (log(p)) . 


□ 


Lemma B.6. Suppose XI is a bounded domain and S C 14 is a finite subset with fill distance h < h$ := 
min(/i2, /13), where ho = ho(m,d). There exists a constant then the family of functions (Xf)les have the 
property that for any s = a fXf> 

ciq d,v \Mi p (~) < IMIp. 

d(d — 2m) d? 

holds with Ci > Cp ^ (log ^)) - ? 7 with 0 < e < m— d/2 and C = C(d,p,m,e). 


Proof. Since s(£) = aj, the L^ case follows immediately with constant 1. For 1 < p < 00 we use (B.4) and 
Lemma IBJ5I to make the estimate 



Applying J B 


E f eH a C xd x ) <f n E (e s a CXd x ) 


V 

dx , the result follows with 



(«7 d ) 1/p > C d ,m,p,eP 


d(d — 2m) 
2 ep 


d 2 

(log (p))~^- 


□ 
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